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Chapter 1 
Newtonian mechanics 



1.1 Reference frames 

An important aspect of the fundamental law of Newtonian mechanics, 

F = ma, (1.1.1) 

is that it is formulated in a reference frame which is either at rest or moving with 
a uniform velocity (the velocity must be constant both in magnitude and in direc- 
tion). Such frames are called inertial frames. A reference frame is a set of three 
axes attached to a point O called the origin. The position of the origin in space is 
arbitrary, but some specific choices are sometimes convenient. For example, when 
describing a system of N bodies it is usually a good idea to place the origin at the 
centre of mass (which will be introduced below). The origin of an inertial frame 
is either fixed or moving uniformly relative to another inertial frame. The orienta- 
tion of the axes is also arbitrary, but some specific choices can again simplify the 
description. For example, when studying the motion of a particle in a gravitational 
field it is convenient to align one of the coordinate axes with the direction of the 
gravitational force. 

The coordinate axes define a set of basis vectors x, y, and z. (These are some- 
times denoted i, j, and fe.) These vectors point in the directions of increasing x, 



y, and z, respectively, and they all have a unit norm: x x = y y = z z = l: 
this property is indicated by the "hat" notation. Relative to a choice of origin O, a 
particle has a position vector r(t) at time t. This is decomposed in the basis as 

r(t) = x(t)x + y(t)y + z(t)z. (1-1-2) 



The functions x(t), y(t), and z(t) are the particle's coordinates relative to the refer- 
ence frame. The coordinates change as t varies, and the particle traces a trajectory 
in three-dimensional space. The central goal of Newtonian mechanics is to deter- 
mine this trajectory, assuming that the force F acting on the particle is known at 
all times. 

The particle's velocity vector is 

dv 

v(t) = = ±(t)x + y(t)y + z{t)z, (1.1.3) 

where we have introduced the notation x = dx/dt = v x ; we shall also use r = dr/dt 
as an alternative notation for the vector v. The particle's momentum vector is 
defined by 

p = mv, (1-1-4) 
where m is the particle's mass. The particle's acceleration vector is 

dv 

a(t) = -77= x(t)x + y(t)y + z{t)z, (1.1.5) 
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Figure 1.1: Two reference frames, So and Si, separated by a displacement b. 

with the notation x = d 2 x/dt 2 = v x = a x . Newton's equation, ma = F, has the 
mathematical structure of a system of second-order differential equations for the 
coordinates x(t), y{t), and z(t). To describe the particle's trajectory, knowing the 
force, it is necessary to integrate these differential equations. 

Suppose that we have two reference frames, So and Si, separated by a dis- 
placement b (see Fig. 1.1). Relative to Si the position vector of a particle is rj.; 
relative to So it is Tq. The transformation between the two position vectors is clearly 
r = b + r± , or 

n = r -6. (1.1.6) 

Suppose now that Si moves relative to So, so that the vector b depends on time. 
Since the position vectors also depend on time, Eq. (1.1.6) should be written as 
t*i(£) = r (t) — b(t). Taking a time derivative produces the transformation between 
the velocity vectors: 

v 1 =v -b. (1.1.7) 

Taking a second time derivative gives us the transformation between the acceleration 
vectors: 

ai=a -6. (1.1.8) 

If So is an inertial frame, then the equations of motion for the particle as viewed in 
So are mao = F. In Si the equations are instead 

max = F — mb. (1.1.9) 

We see that Newton's equation is preserved only if b = 0, that is, if b is a constant 
vector. In this case Si moves relative to So with a constant velocity, and it is also 
an inertial frame. When, however, Si is not inertial, the equations of motion do 
not take the Newtonian form. We have instead Eq. (1.1.9), which can be rewritten 

as 

ma l = F + -Factitious, 

with Factitious = —mb. The second term on the right can be thought of as a 
fictitious force that arises from the fact that the reference frame is not inertial. A 
well-known example is the centrifugal force, which arises in a rotating (and therefore 
non-inertial) frame of reference. 

We now consider a situation in which Si and So are both inertial. We assume, 
in fact, that they share a common origin O, but that they differ in the orientation 
of the coordinate axes. A concrete example (see Fig. 1.2) is one in which Si is 
obtained from So by a rotation around the z axis. In this case the basis vectors X\ 
and yi differ in direction from x and y . Similarly, the particle's coordinates X\{t) 
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Figure 1.2: The frame Si is obtained from So by a rotation around their common z 
axis. 

and y\(i) differ from xo(t) and yo(t). But it is an important fact that the position 
vector r(t) is not affected by the rotation: 

n = xxxx + yiyi + z x z i 
= x *o + yoyo + z io 
= r . 

This conclusion follows simply from the fact that r = r\ = ro is a vector which 
points from O to the particle, independently of the orientation of the reference 
frame. So although the basis vectors and the coordinates all change separately under 
a rotation of the frame, the position vector is invariant. From this observation it 
follows that V\ = Vq = v and a x = a = a: the velocity and acceleration vectors also 
are invariant under a rotation of the reference frame. Similar considerations reveal 
that the vector F is invariant, and we conclude that the form of Newton's equation 
F = ma is not affected by a rotation of the reference frame. (These invariance 
properties are exactly what motived the formulation of Newton's mechanics in terms 
of vectorial quantities.) 



Exercise 1.1. Determine how the coordinates x and y, as well as the basis vectors x and 
y, change under a rotation around the z axis by an angle a. Then show mathematically 
that r is invariant under the transformation. 



1.2 Alternative coordinate systems 

The discussion of the previous section will have made it clear that the Cartesian 
coordinates (x, y, z) play an important role in Newtonian mechanics. We might even 
say that they have a preferred status. The same can be said of the associated set of 
basis vectors x, y, and z. We are aware, however, of situations in which it may be 
advantageous not to use the Cartesian coordinates, but to switch to another, more 
convenient system. What happens then to the formulation of our fundamental law, 
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F = ma! The answer, as we shall see in this section, is that while the law itself 
does not change, its concrete mathematical form may actually look very different. 

To keep things specific we choose here to work in the x-y plane (we set z = 0) 
and to consider a specific example of an alternative coordinate system, the polar 
coordinates r and <p. These are defined by 

x = rcos<j), y = rsm<t>; (1.2.1) 

the radial coordinate r measures the distance from the origin to the particle, and 
4> is the angle relative to the x axis. In terms of the new coordinates the position 
vector is 

r = (rcos<f))x + (r sm<p)y, (1-2-2) 

and it is now a function of r and <j>. We may express this as r — r(r, <fi), and the 
vector r points to the position identified by the coordinates (r, 0). Notice that r is 
the magnitude of the position vector: r • r = r 2 . 

As the particle moves in the plane its coordinates r and vary with time, and 
the particle's velocity vector is v — r, or 

v = (r cos <j> — r0sin0)a; + (r sin0 + rcb cos 0)y. (1.2.3) 

Notice that the magnitude of the velocity vector is not equal to r; instead v ■ v = 
r 2 + r 2 2 . The acceleration vector is then a = v, or 

a = (r cos (j) — 2f(/>sin <f> — rep 2 cos <p — r<j> sin (f)x 

+ (rsincj) + 2r</>cos (f> — r(j) 2 sin0 + rificos 4>)y. (1-2-4) 

As presented here, these vectors are resolved in the Cartesian basis x and y. It is 
more convenient to resolve them instead in the polar basis f and (f>, where 

r = unit vector pointing in the direction of increasing r (1.2.5) 

and 

4> = unit vector pointing in the direction of increasing <\>. (1.2.6) 

It is important to note that these new basis vectors, unlike x and y, are not constant 
vectors: their directions change as we move from point to point in the plane. 

To find an expression for r we observe that by construction, the infinitesimal 
vector 

dv 

Sr = r(r + 5r, 6) — r(r, 6) = — 5r 

or 

points in the direction of increasing r. This means that r must be proportional to 
dr/dr. Looking back at Eq. (1.2.2), we see that this is given by cos<j>x + sm(j>y, 
and we find that this vector already has a unit norm: (dr/dr) ■ (dr/dr) — cos 2 <f) + 
sin 2 (f> — 1- We conclude that 

dr 

r = — = cos a; + sin^y (1-2-7) 
dr 

is the desired basis vector. We proceed similarly to find an expression for <f>. We 
observe that the infinitesimal vector 

dr 

dr = r(r, <f) + 8cf>) — r(r, <f>) = — 6<j> 



points in the direction of increasing (f>. (Be careful: this is a different Sr from the 
one considered before!) This means that must be proportional to dr/d(j), which 
is given by — r sin a; + rcos0y. The squared norm of this vector is (dr/d(f>) ■ 
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(dr/d(ft) = r 2 sin 2 (ft + r 2 cos 2 (ft = r 2 , and to get a unit vector we must divide dr/d<ft 
by r. We conclude that 

1 dr 

(ft = -— = — sin(ftx + cos (fty (1.2.8) 
r o<p 

is the desired basis vector. 



Exercise 1.2. Check that f ■ <ft = 0. 



Let us now work out the components of the vectors r, v, and a in the basis 
(r,cft). According to Eqs. (1-2.2) and (1-2.7) we have 

r r = [(r cos <ft)x + (r sin (f>)y] ■ [cos (f>x + sin (fty] 
= r cos 2 (ft + r sin 2 (ft 
= r. 

Similarly, Eqs. (1.2.2) and (1.2.8) give 

r ■ (ft = [(r cos <ft)x+ (r sin (ft)y] ■ [— sin (ftx + cos (fty] 
= — r sin </> cos </> + rsin^cos^ 
= 0. 

From these results we infer that 

r = rr, (1.2.9) 

and this expression should not come as a surprise, given the meaning of the quanti- 
ties involved. Proceeding similarly with the vectors v and a, we find that they are 
decomposed as 

v = rr + r<ft(ft (1.2.10) 

and 

a= (r-r(ft 2 )f+ --f (r 2 c/>U (1.2.11) 
v ' r at 

in the new basis. As we have pointed out, the components of r in the polar basis 
are obvious, and the components of v also can be understood easily: The radial 
component of the velocity vector must clearly be v r — r 1 and the tangential compo- 
nent must be v§ — r(ft because the factor of r converts the angular velocity (ft into 
a linear velocity. 

The components of the acceleration vector are not so easy to interpret. It is im- 
portant to notice that the radial component of the acceleration vector is not simply 
a r = r, and the angular component is not simply — (ft. It is a general observation 
that the components of the acceleration vector are not simple in nonCartesian coor- 
dinate systems. It should be observed that the radial component of the acceleration 
vector contains both a radial part f and a centrifugal part — rift 2 = —v\jr. 



Exercise 1.3. Verify by explicit calculation that Eqs. (1.2.10) and (1.2.11) are correct. 



Suppose now that the force F has been resolved in the polar basis (r, eft). We 
have 

F = F r r + F 4> 4>, (1.2.12) 

and Newton's law F = ma breaks down into two separate equations, the radial 
component 

r-r<ft 2 = — (1.2.13) 
m 
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and the angular component 

Ur^) = rF± - (1-2-14) 
at ' m 

These are the equations of motion for a particle subjected to a force F, expressed 
in polar coordinates (r, <p) . When, for example, F$ — and the force is purely 
radial, then according to Eq. (1.2.14), r 2 tfi — rv^ is a constant of the motion. 
When, in addition, f = and the particle travels on a circle r — constant, then 
Eq. (1.2.13) reduces to rip 2 — v 2 /r = —F r /m; this is the familiar equality between 
the centrifugal acceleration v^/r and (minus) the radial component of the force 
(divided by the mass). 

Exercise 1.4. Consider the spherical coordinates (r,9,(j>) defined by x = rsin#cos0, 
y — r sin 8 sin 4>, and z = r cos 8. Show that in this alternative coordinate system, the basis 
vectors are given by 

dr 

f = — = sin # cos 4>x + sin 6< sin 4>y + cos 6 z, 
or 

1 dr 

6 — - — — cos 9 cos (b x + cos 8 sin Sy — sin 8 z , 
r ad 

1 1 dr . ± „ 

4> = — — — - = -sm^x + cos0y. 
r sm 8 o(f> 

Verify that these vectors are all orthogonal to each other. 



1.3 Mechanics of a single body 

In this section we explore some consequences of the law F — ma when it applies to 
a single particle. 

1.3.1 Line integrals 

We begin with a review of some relevant mathematics. Let A be a vector field 
in three-dimensional space. (A vector field is a vector that is defined in a region 
of space and which may vary from position to position in that region.) Let C be 
a curve in three-dimensional space, and let ds be the displacement vector along 
the curve. The displacement vector is defined so that ds is everywhere tangent to 
the curve, and such that its norm ds = \ds\ is equal to the distance between two 
neighbouring points on the curve; the total length of the curve is the integral J c ds. 
Now introduce 




A ■ ds, 



the line integral of the vector field A between point 1 and point 2 on the curve C. 
Such integrals occur often in physics. In the present context the force F will play 
the role of the vector field A, and the particle's trajectory will play the role of the 
curve C; we then have ds = dr — vdt and the line integral will be the work done 
by the force as the particle moves from point 1 to point 2. 

It is a fundamental theorem of vector calculus that if a line integral between 
two fixed points in space does not depend on the curve joining the points, then the 
vector field A must be the gradient V/ of some scalar function /. This theorem is 
essentially a consequence of the identity 




ds = f(2) — /(l) independently of the curve, 
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Figure 1.3: Line integrals of a vector field A. 



which is a generalization of the statement J (df /dx) dx = fib) — / (a) from ordinary 
calculus. Another way of presenting this result is to say that if A = V/, then 
§ A ■ ds — for any closed curve C in three-dimensional space. This last statement 
follows because if the curve C is closed, point 2 is identified with point 1, and 
§Vf-ds = f(l)-f(l) = 0. 

To illustrate these notions let us work through a concrete example. Consider 
the vector field A = (x, y) in two-dimensional space. We wish first to evaluate the 
line integral of A along the x axis, from x = — 1 to a; = +1 (see Fig. 1.3). The safest 
way to proceed is to first obtain a parametric description of the curve C, which in 
this case is the line segment that links the points x = We may describe this 
curve in the following way: 

x(u) = — 1 + 2u, y(u) = 0, 

where the parameter u is restricted to the interval < u < 1. (The choice of param- 
eterization is arbitrary; we might just as well have chosen x as the parameter, but it 
is generally a good idea to keep the parameter distinct from the coordinates.) From 
these equations it follows that the displacement vector on C has the components 
dx = 2du and dy = 0, so that ds = (2dw,0). The vector field evaluated on C is 
A = (—1 + 2m, 0), and we have A ■ ds = 2( — 1 + 2u) du. The line integral is then 

/ A-ds= [ 2(-l + 2u)du. 
Jc Jo 

Evaluating this ordinary integral is straightforward, and the result is zero. We 
therefore have 

/ A ■ ds = 

Jc 

for this choice of curve linking the points (x = — 1, y = 0) and (x = 1, y — 0). 

Let us now evaluate the line integral of A along a different curve C which joins 
the same two endpoints (refer again to Fig. 1.3); we choose for C a semi-circle of 
unit radius, which we describe by the parametric relations 



x(9) = — cos 8, y(6)=s'm6, 
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with a parameter 9 running from 9 = to 9 — it. Now we have dx = sin 9 d9 1 
dy = cos 9 d9, and the displacement vector on C is ds = (sm9 d6, cos 9 d9). The 
vector field evaluated on C is A = (— cos 9, sin 6*), and we have A ■ ds = 0. The 
line integral is obviously 




A ■ ds = 



for this choice of curve also. You might experiment with other curves, and invariably 
you will find that J A ■ ds = for all curves C that link the points (—1, 0) and (1, 0) 
in the x-y plane. 



Exercise 1.5. Evaluate the line integral J c „ A ■ ds for the vector field A — (x, y), for 
a curve C" that consists of a line segment that goes from (—1,0) to (0, —1) and another 
line segment that goes from (0, —1) to (1, 0). 



Because the line integral is independent of the path, A must be the gradient 
of a scalar function /. We must have A x = df /dx — x and A y = df /dy = y. 
Integrating the first equation gives 

/ = i.T 2 + unknown function of y, 

where we indicate that the "constant of integration" can in fact depend on y, which 
is held fixed during integration with respect to x. Integrating instead the second 
equation gives 

1 2 

f = -y + unknown function of x. 

These results are compatible only if the unknown function of y is in fact \y 2 ! , and 
the unknown function of . We may still add a true constant to the result, 

and we find that the function / must be given by 

f = \(x 2 + y 2 )+fo, 

where fo = constant. It is then easy to verify that V/ = A. It now becomes clear 
why the line integral had to be zero for any path linking the points (—1,0) and 
(1,0): Irrespective of the path the integral has to be equal to /(1,0) — /(— 1,0) = 
(5 + fo) — (| + fo) = 0, as we have found for C and C . 



1.3.2 Conservation of linear momentum 

We now proceed with our exploration of the consequences of the dynamical law 
F = ma. The first main consequence follows immediately from Newton's equation: 
In the absence of a force acting on the particle, the linear momentum p = mv is a 
constant vector. This follows from the alternative expression of Newton's law, 




(1.3.1) 



if F = then dp/dt = and the vector p must be constant. We therefore have 
conservation of (linear) momentum in the absence of an applied force. 

1.3.3 Conservation of angular momentum 

Relative to a choice of origin O, the angular momentum of a particle at position r 
is defined by 

L = rxp = mrxv. (1.3.2) 
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The angular-momentum vector changes if the origin of the reference frame is shifted 
to a different point in space. The torque acting on the particle is defined by 

N = r x F. (1.3.3) 

(This is also called the moment of force.) We have, as a consequence of Newton's 
equation, dL/dt = m(v xv + rxa) = rxF, since the first term obviously vanishes. 
This gives 

§ = <^> 

and we obtain a statement of angular-momentum conservation: In the absence of a 
torque acting on the particle, the angular momentum L is a constant vector. It is 
clear that TV = when F = 0, but it is possible to have a vanishing torque even 
when F ^ 0; this occurs when F always points in the direction of r. 



1.3.4 Conservation of energy 

The statements of conservation of linear and angular momenta were easy to formu- 
late and prove, but these statements hold only in very rare circumstances: F must 
vanish for p to be constant, and N must vanish for L to be constant. As we shall 
see, the statement of conservation of energy is more difficult to make, but it holds 
much more widely. 

Let a particle move from point 1 to point 2 under the action of a force F. The 
total work done on the particle by the force, as it moves from 1 to 2, is by definition 
the line integral 

Wi2 = J^ F-dr, (1.3.5) 

where dr = v dt is the displacement vector along the particle's trajectory. As we 
shall now infer, the line integral is equal to the total change in the particle's kinetic 
energy, 

T = -mv 2 = kinetic energy, (1.3.6) 

as it moves from 1 to 2. We have introduced the notation v 2 = v ■ v = \v\ 2 . The 
statement of the work-energy theorem is thus 

W 12 = T{2) - T(l). (1.3.7) 

To prove this we substitute F = mdv/dt and dr = v dt inside the line integral of 
Eq. (1.3.5). We get 



f 2 dv 

W\2 = m I — ■ vat. 
Ji dt 



The integrand is 



dv dv x dv v dv x 

— • v = v T H -v,, H v„ 

dt dt x dt y dt v 

1 d 2 1 d 2 li j 



2dt Vx+ 2dt Vy+ 2dt Vz 
1 d ( 2 2 
2dt( V * + < 

d (I 2 
dt{2 V 



and the line integral becomes 

f-2 



W » = L Jt(l mv2 ) dt = l f * = /*T = T(2)-T(l). 
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This is the same statement as in Eq. (1.3.7), and we have established the work- 
energy theorem. 

In very many situations the line integral J t F ■ dr is actually independent of the 
trajectory adopted by the particle to go from point 1 to point 2. In these situations 
we must have that F is the gradient of some scalar function f(r). We write / = —V, 
inserting a minus sign for reasons of convention, and express the force as 

F = -W(r). (1.3.8) 

The scalar function V is known as the potential energy of the particle. When F is 
expressed as in Eq. (1.3.8) the line integral of Eq. (1.3.5) becomes 



W 12 



= -J VV ■ dr = -[V {2) -V {!)]., 



and this is clearly independent of the particle's trajectory: The total work done is 
equal to the difference V"(l) — V(2) no matter how the particle moves from 1 to 2. 
Equation (1.3.7) then becomes V(l)-V(2) = T(2)-T(l), or T(l) + V(l) = T(2) + 
V(2). This tells us that the quantity T + V stays constant as the particle moves 
from point 1 to point 2. We therefore have obtained the statement of conservation 
of total mechanical energy 

E = T + V =^mv 2 + V(r) (1.3.9) 

for a particle moving under the action of a force F that derives from a potential V. 

We can verify directly from Eq. (1.3.9) that the total energy is a constant of the 
motion. We have 

dE _ 1 dv 2 dV 
~dt ~ 2 m ~dT + ~dt' 

As we have seen, 

dv 2 „ dv 
— =2— v. 
dt dt 

The potential energy V depends on time only through the changing position of the 
particle: V = V{r(t)) = V(x(t),y(t), z(t)). We therefore have 

dV dV dx + dVdy dV dz 

dt dx dt dy dt dz dt 

= W-v. 



All of this gives 



dE 
~dt 



= ma ■ v + VT^ • v 
= F -v - F -v 

= 0, 



as expected. 

An example of a force that derives from a potential is gravity: The force 

gravity = mg = m.g(0, 0, -1) (1.3.10) 

is the negative gradient of 

Vgravity = mgz. (1.3.11) 

We have indicated that the vector g points in the negative z direction (down, 
that is); its magnitude is the gravitational acceleration g ~ 9.8 m/s 2 . The total 
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mechanical energy E is conserved when a particle moves under the action of the 
gravitational force. 

An example of a force that does not derive from a potential is the frictional force 

^friction = ~kv, (1.3.12) 

where k > is the coefficient of friction; this force acts in the direction opposite to 
the particle's motion and exerts a drag. It is indeed easy to see that -Ffnction cannot 
be expressed as the gradient of a function of r. (The expression Vf r i c tion — kv • v 
might seem to work, but this potential depends on both r and v, and this is not 
allowed.) This implies that in the presence of a frictional force, the total mechanical 
energy of a particle is not conserved. The reason is that the friction produces heat, 
which is rapidly dissipated away; because this heat comes at the expense of the 
particle's mechanical energy, E cannot be conserved. Energy conservation as a 
whole, of course, applies: the amount by which E decreases matches the amount of 
heat dissipated into the environment. 

It is important to understand that the work-energy theorem of Eq. (1.3.7) is 
always true, whether or not the force F derives from a potential. But whether 
E is conserved or not depends on this last property: When F = -V7 we have 
dE/dt = and the total mechanical energy is conserved; but E is not in general 
conserved when the force does not derive from a potential. 



1.3.5 Case study #1: Particle in a gravitational field 

To illustrate the formalism presented in the preceding subsections we now review the 
problem of determining the motion of a particle in a gravitational field. The force 
is given by Eq. (1.3.10), F = mg = mg(0, 0,-1), and the potential by Eq. (1.3.11), 
V = mgz. The equations of motion are 

x = 0, y = 0, z = -g. (1.3.13) 

These are easily integrated: 

x(t) = x(Q) + v x (0)t, y(t) = y(0) + v v (0)t, z(t) = z(0) + v z (0)t - ^gt 2 . 

(1.3.14) 

These equations describe parabolic motion. Here x(Q), y(0), z(0) are the positions 
at time t — 0, and v x (0), v y (0), and i> z (0) are the components of the velocity vector 
at t = 0; these quantities are the initial conditions that must be specified in order 
for the motion to be uniquely known at all times. The velocity vector at time t is 
obtained by differentiating Eqs. (1.3.14); we get 

v x (t) = v x (0), v y (t)=v y (0), v z (t) = v z (0) - gt. (1.3.15) 

With Eqs. (1.3.14) and (1.3.15) we have sufficient information to compute the total 
mechanical energy E = T + V of the particle. After some simple algebra we obtain 



E = -to 
2 



^(0) 2 + v y (0y + v z (0) 2 ] + mgz(0) (1.3.16) 
for all times t; this is clearly a constant of the motion. 



Exercise 1.6. Verify that Eqs. (1.3.14) really give the solution to the equations of 
motion r = g. Then compute E and make sure that your result agrees with Eq. (1.3.16). 
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1.3.6 Case study #2: Particle in a gravitational field subjected to air 

resistance 

We now suppose that the particle is subjected to both a gravitational force mg and 
a frictional force — kv supplied by the ambient air. For convenience we set k = m/r, 
thereby defining the quantity r, and the total applied force is 

F = m(g -v/t). (1.3.17) 

The equations of motion are ma = F, or a = g — v/t, or again 

r + r/r = g. (1.3.18) 

We assume that the particle is released from a height h with a zero initial 
velocity. The initial conditions are therefore z(0) = h and i(0) = 0. We assume 
also, for simplicity, that there is no motion in the x and y directions. The only 
relevant component of Eq. (1.3.18) is therefore 

v + v/T = -g, (1.3.19) 

where we have set v — z. To arrive at Eq. (1.3.19) we have used the fact that 
g = .g(0,0,-l). 

Our task is to solve the first-order differential equation of Eq. (1.3.19). We use 
the method of variation of parameters. Suppose first that g = 0. In this case the 
equation becomes dv/dt — —v/t or dv/v = —dt/r. This is easily integrated, and we 
get ln(u/c) = — t/r, or v = ce~'/ T . This is the solution for g = 0, and the constant 
of integration c is the solution's parameter. To handle the case j^Owe allow c to 
depend on time — we vary the parameter — and we substitute the trial solution 

v(t) = c{t)e'^ T 

into Eq. (1.3.19). We have v = ce _t / T — v/t and —g = v + v/t = ce~ t l T . The 
differential equation for c(i) is therefore 

c=-ge t / T , 

so that 

c(t) = -STe t/r + cq, 

where Co is a true constant of integration. The result for v(t) is then 

v(t) = -gT + c e-^ T . 

To determine cq we invoke the initial condition v(0) = 0. Because v(0) = —gr + c 
we have that Co = gr. Our final answer is therefore 

v(t) = -gT[l-e~ t/T ]. (1.3.20) 

This is z, the z component of the particle's velocity vector. Integrating Eq. (1.3.20) 
gives z(t), the position of the particle as a function of time. 



Exercise 1.7. Integrate Eq. (1.3.20) and obtain z(t). Make sure to impose the initial 
condition z(0) = h. 



Equation (1.3.20) simplifies when t is much smaller than r = m/k. At such 
early times, when t/r <C 1, the exponential is well approximated by e~ l l T ~ 1 — t/r 
and Eq. (1.3.20) becomes 

v(t) ~ -gt, 
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X 



7. 

Figure 1.4: Motion of a pendulum. For convenience the z axis is taken to point down- 
ward. The angle between the position of the pendulum and the vertical is denoted 9(t). 




Figure 1.5: Forces acting on the pendulum. 

in agreement with Eq. (1.3.15). At such early times the velocity is low, and the 
frictional force is so weak that it has no noticeable effect on the motion. As v 
increases the frictional force becomes more important and it starts to dominate 
over gravity. At late times, when t is much larger than r, the exponential term in 
Eq. (1.3.20) is very small, and the velocity is now approximated by 

v(t) ~ — gr. 

At such late times the velocity is constant: The particle has reached its terminal 
velocity given by ^terminal = 5 T = gm/k. 

1.3.7 Case study #3: Motion of a pendulum 

We now examine the motion of a pendulum, which consists of an object of mass m 
attached to a massless, but rigid, rod of length I. The geometry of the problem is 
illustrated in Fig. 1.4; we shall describe the motion of the pendulum in terms of the 
swing angle 9. 

As shown in Fig. 1.5, there are two forces acting on the pendulum. The first is 
gravity, pulling down, and the second is the tension within the rod, which always 
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pulls in the rod's direction. The geometry of the problem suggests that it might be 
a good idea to involve the polar coordinates introduced in Sec. 1.2. Adapting the 
notation somewhat, we express the Cartesian coordinates x and z of the mass m in 
terms of the new coordinates r and 9; the relationship is 

x = rsa\9 1 z — rcos9. (1.3.21) 

At a later stage of the calculation we will incorporate the fact that the distance 
r between m and the origin of the coordinate system is constant: r = £. For the 
moment, however, we shall pretend that r is free to change with time. 

The polar coordinates (r, 9) come with the basis of unit vectors f and 9, with 

dr 

r = — = sin 9 x + cos 9 z 
or 

and 

s 1 dr 

6 = - — — = cos 9 x — sin 9 z , 
r o9 

where r(r,9) = rsm9x + rcos9z is the position vector expressed in terms of the 
polar coordinates. The unit vector r points in the direction of increasing r (always 
away from the origin), while the unit vector 6 points in the direction of increasing 
9. 

As we have seen in Sec. 1.2, the acceleration vector of the mass m can be 
expressed in the polar coordinates and resolved in the new basis vectors. Repeating 
the calculations carried out there, we find 

a = (r-r9 2 )r + ~ t {r 2 9)6. (1.3.22) 

The net force acting on the mass m is F = T + mg, the vectorial sum of the tension 
and gravitational forces, respectively. Because the tension is directed along the rod, 
we have T = —Tr, with T denoting the magnitude of the tension. The force of 
gravity, on the other hand, is directed along the z direction, and we have rag = mgz. 
Resolving this in the new basis (Fig. 1.5), we have mg = mgcos9r — mgsm9 7 
and the net force is 

F = (— T + mg cos 9)r — mg sin 9 0. (1.3.23) 
Equating this to ma produces 

m(r — r9 2 ) = —T + mg cos 9, - (r 2 9) = —g sin 9, 

the equations of motion for the pendulum. 

These equations simplify considerably when we finally incorporate the fact that 
r = I and does not change with time (so that r = r = 0). The first equation gives 
us an expression for the tension: T = m(£9 +g cos 9). The second equation reduces 
to £9 = —gsin9, or 

9 + uj 2 sm9 = 0, (1.3.24) 

where 

w = \fgU (1.3.25) 
has the dimensions of inverse time (or frequency). 



Exercise 1.8. Make sure that you can reproduce all the algebra that goes into the 
derivation of Eqs. (1.3.24) and (1.3.25). 



Exercise 1.9. Equation (1.3.24) can also be derived on the basis of Eq. (1.3.4), 
dL/dt = N, where L = rar x v is the pendulum's angular momentum and N — r x F 
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the net torque acting on it. Work through the details and verify that this equation does 
indeed lead to Eq. (1.3.24). This method of derivation does not require the new basis of 
unit vectors; all calculations can be carried out in the Cartesian basis. 



The second-order differential equation of Eq. (1.3.24) determines the motion of 
the pendulum. It can immediately be integrated once with respect to time. The 
trick is to multiply Eq. (1.3.24) by 0; this gives 



Now note that 



and 



We therefore have 



89 + {uu 2 sin 0)0 = 0. 

M= l -U 2 
2dt 



(sin 0)0 = -^cos0. 



or 



dt\2 

^0 2 — uj 2 cos = e = constant. (1.3.26) 

This is a first-order differential equation for 0(i). 

It seems intuitively plausible that the conserved quantity e should have some- 
thing to do with the pendulum's total energy E. This is indeed the case. The kinetic 
energy is T — \m{x 2 + z 2 ) = ^ml 2 9 2 , according to our previous results. The po- 
tential energy associated with the gravitational force is V = —mgz = — mg£ cos 9 = 
— m£ 2 uj 2 cos 0, where we have used Eq. (1.3.25). The potential energy associated 
with the rod's tension is zero: The tension always acts in the rod's direction, which 
is always perpendicular to the direction of the motion; the tension does no work on 
the pendulum. We finally have E = T + V = ml 2 {\9 2 - uu 2 cos0), or 

E = ml 2 e. (1.3.27) 

We shall call e the pendulum's reduced energy. Similarly we shall call \9 2 the 
reduced kinetic energy and v{9) = —uj 2 cos the reduced potential energy. 

The qualitative features of the pendulum's motion can be understood without 
further calculation, purely on the basis of the following graphical construction. We 
draw an energy diagram, a plot of the reduced potential energy u(9) = — u) 2 cos9 
as a function of 0, together with the constant value of the reduced energy e (see 
Fig. 1.6). According to Eq. (1.3.26), which we rewrite as 

]^9 2 =e-v{9) 1 u{8) = -w 2 cos0, (1.3.28) 

the difference between e and v(9) is equal to the reduced kinetic energy \9 2 . For 
motion to take place this difference must be positive, and a quick examination of 
the diagram reveals immediately the regions for which e — v(9) < 0. Motion is 
possible within these regions, and impossible outside. 

For example, when e < lo 2 we see that the motion of the pendulum takes place 
between the two well-defined limits = ±0o; motion is impossible beyond these 
points. This situation corresponds to ordinary pendulum motion: The weight oscil- 
lates back and forth around the horizontal axis (0 = 0), with an amplitude 6q. The 
diagram reveals that the angular velocity |0| is maximum when the weight crosses 
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-4 -2 2 4 



e 

Figure 1.6: Energy diagram for the pendulum. The difference between the line e = 
constant and the curve u{9) = — lo 2 cos 8 is the reduced kinetic energy \d , which must be 
positive for motion to take place. The lower value of e is such that e < to 2 . The higher 
value is such that e > oj 2 . In the plot lo 2 is set equal to 1. 

9 = 0, and that the pendulum comes to a momentary rest (9 = 0) when 9 = ±# - 
This amplitude is determined by setting 9 = in Eq. (1.3.28); we have 

e = v{6q) = -cj 2 cos6» . (1.3.29) 

This equation can be solved for 9q whenever e < to 2 ; there are no solutions otherwise. 
When e > u) 2 the diagram reveals that there are no intersections between the line 
e = constant and the curve v{9). There are no points at which ^9 2 = 0, 9 is 
allowed to increase without bound, and the motion is not limited. This high-energy 
situation corresponds to the weight doing complete revolutions around the pivot 
point. 

Points in the energy diagram at which the line e = constant meets the curve v(&) 
are called turning points. At these points the reduced kinetic energy ^9 drops to 
zero and 9 changes sign, either from the positive to the negative (if 9 was increasing 
toward 9 ), or from the negative to the positive (if 9 was decreasing toward — 6q). 
These are the points at which the pendulum reaches its maximum angle and turns 
around. 

Combining Eqs. (1.3.28) and (1.3.29) gives 

^9 2 = uj 2 (cos9- cos6» ), (1.3.30) 

and this is a first-order differential equation for 9(t). This equation, unfortunately, 
cannot be solved in closed form, unless #o is assumed to be very small (we shall deal 
separately with this simple case at the end of this subsection). The best we can do 
is to express t in terms of an integral involving 9. First we take the square root of 
Eq. (1.3.30), 

9 = ±\/2w\/cos 9 — cos 9q, 
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Figure 1.7: Motion of a pendulum with four different amplitudes. We plot the swing 
angle 9 (in radians) as a function of time. The unit of time is 2n/u>. Notice that the period 
of oscillation increases with the amplitude. Notice also that for high amplitude, the curve 
differs significantly from a sinusoid. 



and we solve for dt. After integration we get 



t = ± 



y/2uj J \f< 



dd 



constant. 



'cos e/ — cos tfo 



(1.3.31) 



This integral must be evaluated numerically, and the result t{9) must be inverted 
to give 9(t); the inversion must also be done numerically. To obtain these details 
requires some labour, and this will not be pursued here. The results of a numerical 
integration are presented in Fig. 1.7. 

The motion of the pendulum is clearly periodic, and Eq. (1.3.31) allows us to 
calculate the period P, the time required for the pendulum to complete a full cycle 
of oscillation {9 going from — 9 to +9 Q and then back to — 9 .) This is twice the 
time required to go from —9 to +9 Ql or four times the time required to go from 
9 = to 9 = 9 . So the period is given by 



P = 



d6 



V2uj Jo V cos 9 — cos 9 
To put this integral in standard form we change the variable of integration to 



sin : 



sin jc'o 



and introduce the parameter 

Simple manipulations reveal that 
dz \f\ — s 2 z 2 



s = sin S oq . 



(1.3.32) 



d8 



2s 



\J cos 9 — cos 9 = V2s\/ 1 — z 2 , 
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and the expression for P becomes 



P=-K(s 2 ), (1.3.33) 



where 



K{s 2 ) = ( dz = (1.3.34) 

is a special function known as the complete elliptic integral of the first kind. A plot 
of this function is shown in Fig. 1.8. While this result is perhaps not too revealing, 
it allows us to conclude that the period increases with the amplitude of the motion. 
This follows because P depends on s 2 = sin 2 |#q through the elliptic integral. 



Exercise 1.10. Make sure that you can reproduce all the algebra that goes into the 
derivation of Eqs. (1.3.33) and (1.3.34). 



We can be more explicit when s = sin^g is fairly small compared with 1. In 
this situation it is known that the elliptic integral can be approximated by 



K = 



1 



1 • 3 

2^4 



1- 3-5 

2- 4-6 



Substituting this into Eq. (1.3.33) gives 



P 



2ir 

U) 



1 + -s l + —s q 
4 64 



25 ^ 
256'' 



(1.3.35) 



When the oscillations are very small, that is when 9q -C 1, we have that s 2 -C 1 
and the period is well approximated by the leading term in the power expansion, 
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P ~ 2tt/lo. In this limit the period becomes independent of the motion's amplitude. 



Exercise 1.11. It is not too difficult to derive the preceding approximation to the 
elliptic integral. When s 2 is small the factor (1 — s 2 z 2 )~ 1 / 2 inside the integral of Eq. (1.3.34) 
can be expressed as a Taylor series about s = 0. Show that this gives 

/-, 2 2x-l/2 ,122.344 5 6 6, 

(1 - s z ) ' = 1 + -s z +-sz + i6 SZ H • 

With this expansion the elliptic integral becomes 

f 1 dz 1 2 f 1 z 2 dz 3 4 f 1 z A dz 5 6 f 1 z 6 dz 

Jo VT^ + 2 S Jo VT^ + 8 S Jo ^^^ + 16 S J + " ' ' 

Evaluate these integrals and verify that your result agrees with the expression quoted in 
the text. 



The case of small oscillations is particularly simple to deal with. Go back to 
Eq. (1.3.24), 6 + uj 2 sinO — 0, and assume that is so small that sin6> is well 
approximated by 0. The equation simplifies to 

6 + uj 2 9 = 0, (1.3.36) 

and we have simple harmonic motion. The general solution to this equation is 

e(t) = 6 cos(wt + 6), (1.3.37) 

where 9 is the amplitude and S the initial phase. The solution reveals that the 
period of the motion is P = 2n /uj, in complete agreement with our previous results. 

1.4 Mechanics of a system of bodies 

1.4-1 Equations of motion 

Generalizing the discussion of the preceding section, we now consider a system of 
N bodies subjected to their mutual forces. For simplicity we assume that there are 
no external forces acting on the particles; these would originate from outside the 
system. Each particle in the system is labeled by a number A = 1, 2, 3, • • • , N. The 
motion of body A is governed by the equation 

m A a A = F A , (1-4.1) 

where m A is the mass of the body, a A its acceleration, and F A is the force acting 
on the body due to all other bodies. Relative to an arbitrary choice of origin O, the 
position vector of body A is r A (t) , its velocity is v A (t) = r A , and its acceleration 
is a A (t) =v A = r A . 

The force acting on body A can be expressed as a sum of individual forces 
exerted by each other body. We write 

F A = Fab - ( 1A2 ) 

Here, F A b is the force exerted on A by B; the sum over B obviously excludes A 
because a body does not exert a force on itself. We assume Newton's third law, 
which states that 

F BA - -F AB - (1.4.3) 

In words, the force exerted on B by A is equal in magnitude and opposite in direction 
to the force exerted on A by B. Suppose, for example, that the force exerted on A 
by B is repulsive; then the force exerted on B by A will also be repulsive, and it 
will point in the opposite direction. 
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1.4-2 Centre of mass 
The centre of mass of a system of N bodies is at a position R which is defined by 

A 

where 

M = ^2m A 

A 

is the total mass of the system. 

The centre of mass moves in accordance with Newton's 

MR = m^a-A 

A 

= E^ 

A 

E Fab > 

A,B,A=£B 

where we have used Eq. (1.4.2). In the last line we sum over both A and B (both 
from 1 to N), but we make sure to exclude all terms for which A = B. Let us 
examine the double sum in the special case of three particles. We have 

N N 

E Fab = EE Fab 

A,B,A^iB A=1B=1 
N 

= ( Fai + Fa2 + Fa z) 

A=l 

= (-F21 + -F31) + (Fl2 + F32) + (-Fl3 + -F23) 

= (-F21 + F 12 ) + (F31 + J13) + (F 32 + F23) 
= 0. 

The double sum vanishes by virtue of Newton's third law, and this property remains 
true for arbitrary values of N. We therefore have 

R = 0, => R(t) = -R(O) + R(0)t. (1.4.6) 

The centre of mass moves with a uniform velocity, and it therefore defines the origin 
of another inertial frame. 

It is usually convenient to shift the origin of the reference frame to the centre of 
mass, by defining new positions vectors r' A (t) according to 

r' A = r A - R. (1.4.7) 

It should be kept in mind that the centre of mass defines the origin of an inertial 
frame only when there are no external forces acting on the particles. When external 
forces are present each particle moves according to itiacla = Fjj ntornal + f eternal ^ 
where the first term represents the internally-produced force acting on A, and the 
second term represents the external force. It is then easy to show that the centre of 
mass will move according to MR = J2a F| xternal ; it is accelerated by the net sum 
of all the external forces. 



(1.4.5) 
law, which implies 



Exercise 1.12. Prove the preceding statement. 
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1.4-3 Total linear and angular momenta 
The total linear momentum of the system of N bodies is defined by 

P = ^p A = J2™ava, (1-4.8) 

A A 

where pa are the individual momenta. We have 

P=j t ^AVA, 

A 

or, according to Eq. (1.4.4), 

P = MR. (1.4.9) 

The total momentum therefore follows the motion of the centre of mass. Because 
R(t) = R(0) according to Eq. (1.4.6), we have the important statement that the 
total linear momentum is a constant vector. If the origin of the inertial frame is 
at the centre of mass, then R = and R — 0; this means that P = 0. In this 
centre-of-mass frame, the total momentum of the system of particles is zero. 
The total angular momentum of the system is 

L = ^ r A X p A = m ATA X V A - (1.4.10) 

A A 

Its rate of change is calculated as 

L = Y m A( v A x v A + r A x a A ) 

A 

= Y2 taxFa 

A 

ta x Fab > 

A,B,AjiB 

where we have again involved Eq. (1.4.2). Let us examine the double sum for the 
special case of three particles. We have 

Y r A x F AB = Y( TaX Fai +rAX Fa ^ + TA x Fa $) 

A,B,A^B A 

= (r 2 x F 2 i +r 3 x F 31 ) + (n x F 12 + r 3 x F 32 ) 

+ (n x Fi 3 + r 2 x F 23 ) 
= (ri - r 2 ) x F 12 + (ri - r 3 ) x F 13 + (r 2 - r 3 ) x F 23 , 

where we have used Eq. (1.4.3). The vector r*i — r 2 is directed from body 2 to body 
1. In most circumstances the force i*i 2 also is directed from body 2 to body 1 (or in 
the opposite direction). Under these conditions the vector product (r\ — r 2 ) x F\ 2 is 
zero, and this is true for all other pairs of bodies. The double sum is therefore zero. 
These considerations generalize to an arbitrary number of bodies, and we conclude 
that 

L = (1.4.11) 

whenever the force Fab points in the direction of the relative separation — 
r B . Under these conditions we have conservation of the system's total angular 
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momentum. 



Exercise 1.13. Calculate dP/dt and dL/dt when there are also external forces acting 
on the particles. 



Let us express the position vector of body A as in Eq. (1.4.7), 

r A = R + r' A , 

where r' A is its position relative to the centre of mass. We write, similarly, 

v A = R + v' A 

We make these substitutions into Eq. (1.4.10), and get 
L = ^m A (R + r' A )x(R + v' A ) 

A 

= m A (Rx R + Rx v' A + r' A x R + r' A x v' A ) 

A 

= (Rx R) m A + Rx m Av' A — R x ^2 m Ar' A + ^ m A r' A x v' A . 

A A A A 

This mess simplifies. For the first term on the right-hand side we have J2a mA = 
the total mass of the system. In the second term we recognize that ^2 A m A v' A is 
the system's total momentum as measured in the centre-of-mass frame; this is zero. 
The third term vanishes also, and we finally have 

L = MR x R + ^2m A r' A x v' A . (1.4.14) 

A 

In this expression, the first term represents the angular momentum of the centre 
of mass, while the second term is the total angular momentum of the system of 
particles relative to the centre of mass. When the origin of the inertial frame is 
placed at the centre of mass, we have R — and the first term disappears. In 
general, we see that L depends on the choice of origin. 



(1.4.12) 



(1.4.13) 



1-4-4 Conservation of energy 

The presentation here parallels closely our discussion of Sec. 1.3.4 on energy con- 
servation for a single particle. The notation of this section, however, will be slightly 
more cumbersome, because we now have to keep track of many particles. 

We begin by calculating the total work done on all the particles as they move 
from a configuration labeled 1 to another configuration labeled 2. (This means that 
in the interval of time over which we follow the particles, each moves from a point 
1 to a point 2 on its trajectory.) This is 

Wi 2 = J2 I F A -dr A = Y.I v a dt, (1-4.15) 

A A 

where dr A = v A dt is the displacement vector on the trajectory of particle A. 
Substituting the equations of motion (1.4.1) gives 

W 12 = [ m A d ^ ■ v A dt. 
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But since va • dvA/dt = \dv\jdt, where v\ — v A • va, this becomes 

where Ta — \mAv\ is the kinetic energy of particle A. Introducing the total kinetic 
energy of the system 

T = Y J T A = Y.\ m ^A, (1-4-16) 

A A 1 

we have obtained the statement of the work-energy theorem, 

W 12 = T(2) - T(l). (1.4.17) 

In words, this states that the total work done on all the particles is equal to the 
difference in total kinetic energy between the configurations 2 and 1. 

Exercise 1.14. Express the total kinetic energy of the system in terms of the centre- 
of-mass quantities R, R and the relative quantities r' A , v' A . You should find an expression 
analogous to Eq. (1.4.14). 

To proceed further we shall assume that the mutual force Fab can be derived 
from a potential Vab = Vba that depends only on the distance tab between the 
bodies A and B. We shall therefore have 

Vab = V AB (r AB ), r AB = \r AB \, r A B = r A -r B . (1.4.18) 

The force acting on A exerted by B is given by 

Fab = -VaVab, (1-4.19) 

where Va = (d/dxA,d/dyA,d/dzA) is the gradient operator with respect to the 
coordinates ta = (xA,yA, za) of body A. Similarly, the force acting on B exerted 
by A is 

F BA = -VbVab, (1-4.20) 

where us the gradient operator with respect to the coordinates r B = {x B , yB, z B ) 
of body B. (To be fully symmetrical we might have written Fba = —^bVba, but 
this produces the same result because Vba is by definition equal to Vab-) 

Let us verify that Fba — —Fab and that the forces are directed along the vector 
TA—r Bl that is, in the direction of the relative separation between the two bodies. 
Let us examine, say, the x component of Fab- According to Eq. (1.4.19) we have 

d 

Fab.x = — q — Vab- 
dx A 

Because Vab depends on xa only through its dependence on the distance rAB, we 
apply the chain rule to evaluate the partial derivative: 

dV A B dr A B _ T/ , dr A B 

Pab.x — ^ — —vab~^ ' 

dr A B ox A ax A 

where the prime indicates differentiation with respect to rAB- To calculate the 
partial derivative of rAB with respect to xa we start with the definition 

r AB = ( X A - x B ) 2 + (y A - y B ) 2 + (z A - z B ) 2 - 
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Differentiating both sides gives 



dr AB 



2r A B^ = 2(xa ~ x B ), 

dx A 

and finally, 

dr A B _ x A - x B 
dx A r AB 

Returning to our main calculation we find that the x component of the force is 

x A - x B 

r AB,x — v ABi 

TAB 

and very similar calculations would reveal also the y and z components. The com- 
plete vectorial expression is 

F AB = - r -^V AB , V AB = d ^. (1.4.21) 
tab ar AB 

This shows that F AB is indeed directed along r AB = r A — r B . 

We now calculate F BA . Looking also at its x component we get from Eq. (1.4.20) 
that 

dV AB dr AB _ dr AB 
dr AB dx B dx B 

Repeating the same steps as before we find that 

dr AB _ x A - x B 
dx B r AB 

which differs by a sign from the preceding expression for dr AB /dx A . We finally 
obtain 



_ x A - x B 

?BA,x - V AB 

"TAB 



and the vectorial generalization 

Fba = —V AB . (1.4.22) 
rAB 

This also is directed along r AB = r A — r B . Comparing Eqs. (1.4.21) and (1.4.22) 
shows that, as required, F BA = —F AB . 

The calculations presented above are important and they occur frequently. To 
go through them with some efficiency it is useful to memorize the rule \7 B V AB — 
— VaVas, which is valid whenever V AB depends on r A and r B only through its 
dependence on r AB = \r A — r B \. 

Having made our assumptions regarding the mutual forces F AB , we now return 
to the work integral of Eq. (1.4.15). Substituting Eq. (1.4.2) gives 



W 12 = ^2 f ab ■ dr A . 

A,B,A^B Jx 



To examine this we again specialize to the case of three particles. We have 

f-2 



Wl2 



= J (F 2 i ■ dr 2 + F 3 i ■ dr 3 + F 12 ■ dn + F 32 ■ dr 3 + F 13 ■ dr x + F 23 ■ dr 2 ) 
= J F 12 - (dr x - dr 2 ) + F 13 ■ (dn - dr 3 ) + F 23 ■ (dr 2 - dr 3 ) 
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But dri — dr 2 = d(r\ — r 2 ) = dr i 2 , so this can be expressed as 

J (F 12 ■ dr 12 + F 13 ■ dr 13 + F 23 ■ dr 23 ) . 



r 2 

W 12 



At this stage of the derivation we incorporate the fact that the mutual forces 
are derived from a potential. As we have seen, F\ 2 = — V1V12, where Vi = 
(d/dxi,d/dyi,d/dzi). But since V\ 2 depends on {xi,y\,z\) only through its de- 
pendence on (xi2, 2/12, z\ 2 ) (where, for example, x\ 2 = x\ — x 2 ), the force can also 
be expressed as F 12 = — V12V12, where Vi 2 is the gradient operator with respect 
to r 12 = (x 12 ,y 12 ,z 12 ) 7 



12 



d d d 



dxi 2 dyi 2 dz 12 



This is possible because dx\ 2 /dxi = 1, and so on. 
So we now have 



W 12 



= J (-V12V12 • dr X2 - ^\-iV\z ■ dr i3 - V 23 V 23 ■ dr 23 ). 



Each integral can be evaluated (refer back to Sec. 1.3.1), giving 

^12 = ~[V 12 (2) - V 12 (l)] - [V 13 (2) - V 13 (l)] - [V 23 (2) - V 23 (l)] 
= -[^12 + ^13 + ^23]?- 
Since V21 = V\ 2 and so on, we may write this as 

W 12 = —[Vw + V 13 + V 21 + V 23 + V 31 + V 32 ] \, 

where we now sum over all possible pairs of indices, provided that each index is not 
repeated. Generalizing to an arbitrary number of particles, this is 



__ n 2 

W12 



A,B,A^B 

We define the total potential energy of the system to be 

V=\ E (1.4.23) 

A,B,A^B 

With this notation our previous result is W\ 2 = — [V(2) - V(l)], and Eq. (1.4.17) 
becomes -V(2) + V(l) = T(2) - T(l) or T{1) + V(l) - T(2) + V(2). 

We have finally established that the total mechanical energy of the system, 

E = T + V = Y,\™av 2 a + \ E V ab{vab), (1-4.24) 

A A,B,A^B 

stays unchanged as the particles move from configuration 1 to configuration 2. We 
recall that the mutual potentials Vab are assumed to depend on tab = \ta — t b\ 
only; the mutual forces are then given by Eqs. (1.4.21) and (1.4.22). This is the 
statement of energy conservation for a system of particles. 



Exercise 1.15. Starting from the definition of Eq. (1.4.24), prove directly that dE/dt = 
0. 
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1.5 Kepler's problem 

To give concreteness to the formal developments of the preceding section we exam- 
ine, in this section, the specific situation of two bodies subjected to their mutual 
gravitational forces. This could be the Earth-Moon system, or the Sun- Jupiter sys- 
tem, or again a binary system of two main-sequence stars. Our goal is to determine 
the motion of the two bodies, that is, to find a solution to Kepler's problem. 



1.5.1 Gravitational force 

The force acting on body 1 due to the gravity of body 2 has a magnitude Gm\m 2 /r\ 2 , 
where G is Newton's gravitational constant, mi the mass of body 1, m 2 the mass of 
body 2, and r 12 is the distance between the two bodies. The force is directed along 
the vector r 2 — ri, which points from body 1 to body 2. Introducing the notation 

r = r 1 -r 2 , r = \n - r 2 \ = r 12 , (1.5.1) 

we write 

F 12 = -Gm im2 ^. (1.5.2) 

The force acting on body 2 due to the gravity of body 1 is 

F 21 =Gm 1 m 2 ^ 1 (1.5.3) 

and it is directed along ri — r 2 , which points from body 2 to body 1. 
These forces can be derived from a mutual potential 

V 12 = -°^. (1.5.4) 
r 

This means that the force of Eq. (1.5.2) is given by 

F 12 = -Vi^ia, (1.5.5) 

where Vi is the gradient operator with respect to the coordinates r\ — (xi, y\, z\) 
of body 1. Similarly, the force of Eq. (1.5.3) can be expressed as 

F 21 = -V2V12, (1.5.6) 

where V 2 is the gradient operator with respect to the coordinates r 2 = (x 2 ,y 2 ,z 2 ) 
of body 2. To verify these statements, let us calculate, say, the z component of F 21 . 
We have 

dV 12 dV 12 dr 

^21. z — q — 1 T. • 

oz 2 dr oz 2 

The first factor is 

dV\ 2 Gm\m 2 
dr r 2 

and to calculate the second factor we start with 

r 2 = (xi - x 2 f + (y 1 - y 2 ) 2 + (z 1 - z 2 f 
and differentiate both sides with respect to z 2 . This gives 

dr 

2r— = -2( Z1 z 2 ) 

or 

dr Z\ — z 2 



dz, 



2 
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So finally, 

F 2 i, z = Gm 1 m 2 — — — , 

and this is clearly compatible with Eq. (1.5.3). Similar calculations would return 
all other components of F 2 i and all components of F 12 , and Eqs. (1.5.5) and (1.5.6) 
would be fully verified. 

According to Eq. (1.4.23), the total potential energy of the two-body system is 

v = \ £ v AB = 1 -{v 12 + v 21 ), 

A,B,A^B 

or 

V = V 12 . (1.5.7) 

This result will allow us, in the following subsections, to omit the label "12" from 
the mutual potential; we shall write, simply, V\ 2 = V = — Gvn\m 2 jr. 

1.5.2 Equations of motion 

Newton's equations for the two bodies are m\r\ = F 12 = —Gmim 2 r/r 3 and m 2 r 2 = 
F 2 i = Gm\m 2 rjr 3 . Simplifying, we arrive at 

ri = -Gm 2 J (1.5.8) 



and 

f 2 = Gm 1 J, (1.5.9) 



where, we recall, r = r\ — r 2 and r = \r\. 

The position vectors r-y and r 2 can be expressed in terms of R, the position of 
the centre of mass, and r, the relative position. We have, according to Eq. (1.4.4), 
MR = rairi + m 2 r 2 , where M = mi + m 2 is the total mass. Simple algebra gives 

ri =R+^r (1.5.10) 

and 

r 2 = R-^r. (1.5.11) 

The motion of the centre of mass is determined by the equation MR = m\r\ + 
m 2 r 2 = —Gmim 2 r/r 3 + Gmim 2 r/r 3 = 0. As we had discovered in Sec. 1.4.2, the 
centre of mass moves uniformly: 

R(t) = R(0) + R(0)t. (1.5.12) 

The motion of the relative position, on the other hand, is determined by the equation 
r = f'i — r 2 = —Gm 2 r/r 3 — Grriir/r 3 , or 

r = -GM^, M = m l +m 2 . (1.5.13) 



Exercise 1.16. Verify Eqs. (1.5.10) and (1.5.11). 



The centre of mass defines the origin of an inertial frame, and the mathematical 
description of the two-body system is simplest in this reference frame. We shall 
therefore set R = 0, which brings Eqs. (1.5.10) and (1.5.11) to the simpler form 



m 2 mi 



(1.5.14) 
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The vector r{t) is determined by integrating Eq. (1.5.13). Once the solution is 
known we obtain immediately, from Eqs. (1.5.14), the vectors r\(t) and r 2 (t), which 
describe the trajectories of the two individual bodies. In Eq. (1.5.13) we have the 
reduction of our original two-body problem to a simpler effective one-body problem. 
The effective body is fictitious; it moves with a position vector r(t) in the gravita- 
tional field of another fictitious mass M = mi + m 2 situated at the centre of mass 
of the original system. 

1.5.3 Conservation of angular momentum 
From Eq. (1.5.13) we can immediately derive the statement 

7* X 7* 

r xr = -GM — — = 0. 

But d(r xf)/dt = fxf + rxf = rxf, and it follows that d{r x f)/dt = 0. The 
vector 

h = r x r (1.5.15) 

is therefore constant during the motion. This must be related to the system's 
total angular momentum which, according to the discussion of Sec. 1.4.3, is also a 
constant vector. The definition of Eq. (1.4.10) gives 

L = m A r A xri = miri x f x + m 2 r 2 x f 2 . 

A 

Substitution of Eqs. (1.5.14) gives 

TOlTOo . TO 2 m? . mi77J2(TOl + TO 2 ) 

L = r x r H ^-r x r = — r x r. 

M 2 M 2 M 2 

Simplification produces 

L=— ^— h, (1.5.16) 

and we find that, sure enough, the vector h is a rescaled version of the total angular- 
momentum vector. We shall call h the reduced angular-momentum vector of our 
two-body system. 

The position vector r(t) must be at all times orthogonal to the constant vector 
h, because r ■ h = r ■ (r x f ) =0. This simple fact has the far-reaching consequence 
that the motion must always take place within a plane that is orthogonal to the fixed 
direction of the vector h. The planar nature of the motion is illustrated in Fig. 1.9. 

Conservation of total angular momentum therefore implies planar motion. To 
describe this mathematically we orient the coordinate system so that the orbital 
plane is the x-y plane, and we direct the vector h along the z axis. We have 

r(t) - x(t)x + y(t)y, (1.5.17) 
r(t) - x(t)x + y(t)y, (1.5.18) 

and 

h = hz. (1.5.19) 
A simple calculation, based on Eqs. (1.5.15) and (1.5.17)-(1.5.19), reveals that 

h = xy — yx = constant. (1.5.20) 



Exercise 1.17. Verify Eq. (1.5.20). 
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h 



Figure 1.9: The position vector r(t) always lies in a plane orthogonal to the constant 
vector h. This plane is called the orbital plane. 

1.5.4 Polar coordinates 

To proceed with our calculations it is convenient to involve the polar coordinates r 
and (f> that were first introduced in Sec. 1.2. These, we recall, are defined by 



x = r cos i 



y = r sin < 



(1.5.21) 



In terms of the new coordinates and the associated basis of unit vectors r and </>, 
we have 



r 
v 



r r, 

r f + r(f> 0, 



* o\ ~ 1 d 1 ' 

r<p )r H (r* 

' r at 



(1.5.22) 
(1.5.23) 

(1.5.24) 



for the position, velocity, and acceleration vectors, respectively. 

If we now substitute Eq. (1.5.24) for a = r into the equations of motion of 
Eq. (1.5.13), we obtain 



12\- ld (2iM GM GM „ 



Equating the radial components of both sides gives 



• 2 GM 

r~r4>= 



(1.5.25) 



while equating the angular components gives 

d 



dt 



(r 2 (j)) = 0. 



(1.5.26) 



These are the equations of motion of the effective one-body problem, expressed in 
their simplest form in terms of polar coordinates. In the following subsections we 
will endeavour to find solutions to these equations. 
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1.5.5 Kepler's second law 

Kepler's second law states that the position vector of a planet orbiting the Sun sweeps 
out equal areas in equal times. The statement generalizes to any two massive bodies, 
and in this case the position vector refers specifically to the relative separation 
r(t) between the two bodies. This law comes as an immediate consequence of 
Eq. (1.5.26), which implies the conservation statement r 2 <f> = constant. 

Let us first show that this constant is in fact h, the magnitude of the constant 
vector h defined by Eq. (1.5.15). According to Eq. (1.5.20), this is given by h = 
xy — yx. Making use of Eqs. (1.5.21), we write this as 

h = (r cos (f>) (f sin <fi + r<j) cos <fi) — (r sin (f>) (r cos (f> — rip sin <p) . 

Simplifying this we obtain 

h = r 2 4> = constant, (1.5.27) 

the expected result. 

The fact that r 2 <p is conserved gives us our statement of the second law. Consider 
Fig. 1.10. During an interval dt of time the position vector moves by an angle d<p and 
sweeps out an area dA. To a good approximation the area is shaped as a triangle 
and we have dA = \r{r d(j>) = \r 2 d(j). The rate at which the position vector sweeps 
out this area is therefore 

^A = lr 2 <P=lh. (1.5.28) 
dt 2 Y 2 v ; 

This is a constant, and we have the mathematical statement of Kepler's second law. 

1.5.6 Conservation of energy 
With the substitution <f> = h/r 2 obtained from Eq. (1.5.27), Eq. (1.5.25) becomes 

GM h 2 



r 



= 0. (1.5.29) 



This equation can immediately be integrated by multiplying all members by r. 
(Recall that we used the same trick back in Sec. 1.3.7.) We have 
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GM . d ( GM 



-r = — - 



dt V r 
d ( h 2 



<ll\2r 2 

and the preceding equation becomes 

d (\ . 2 GM h 2 , 
dt[2 r " — + 2^, 1= °- 

This implies |r 2 — GM/r + h 2 /(2r 2 ) = e, where e is the constant of integration. 
We shall write this result in the form 

if 2 + z/(r)=e, (1.5.30) 

with 

. . GM h 2 . r ni , 

^ (r) = -_ + _. (1.5.31) 

The first term on the left of Eq. (1.5.30) can be thought of as a reduced kinetic energy 
for the radial component of the motion. The second term is a reduced effective 
potential for this motion, and the constant e is a reduced total energy. (Recall that 
we introduced this terminology back in Sec. 1.3.7; by "reduced" we mean a rescaled 
version of the usual quantities.) 

The reduced energy e is directly related to the system's true total energy E. Let 
us calculate it. The system's total kinetic energy is T = ^mi|ri| 2 + \m 2 \r 2 \ 2 , and 
according to Eqs. (1.5.4) and (1.5.7), the potential energy is V = — Gmim 2 /r. The 
total energy is therefore 

„ 1 1 . l9 1 , . l2 Gmim 2 
E=-m 1 \r 1 \ + -m 2 \r 2 \ • 

After involving Eq. (1.5.14) and cleaning up the algebra, this becomes 

m\m 2 ( 1 , . |2 GM\ 



Now Eq. (1.5.23) states r = v = rr + rcjxp, so that \r\ 2 = r 2 + r 2 <j) 2 = r 2 + h 2 /r 2 . 
So 

„ mi m 2 / La GM h 2 

E = — - — - -r 2 1 

M \2 r 2r 2 

and comparing this with Eqs. (1.5.30) and (1.5.31) yields 

E=^e. (1.5.33) 
M v ; 

As promised, e is a rescaled version of the total energy E. Recall that back in 
Sec. 1.5.3 we had similarly obtained L = (m 1 m 2 /M)/i. 



Exercise 1.18. Verify Eq. (1.5.32) and check the algebra leading to Eq. (1.5.33). 



1.5.7 Qualitative description of the orbital motion 

The equations of motion have been reduced to the first-order form of Eqs. (1.5.27) 
and (1.5.30), with the effective potential v(r) given by Eq. (1.5.31). The equation 
4> = h/r 2 informs us that (j) increases monotonically with t: If h is positive (f> is 
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Figure 1.11: Energy diagram for the radial component of the motion. When e < the 
motion takes place between two turning points; this is elliptical motion. When e > the 
motion takes place on the right of a single turning point; this is hyperbolic motion. When 
£ = the motion is parabolic. 

always greater than zero and 4>{t) is an increasing function; if h is negative <j) is 
always smaller than zero and 4>(t) is then a decreasing function. (The case h = 
will be dealt with separately later.) The equation 

1 -2 

— r + vyr) = e 

governs the radial component of the motion. As in Sec. 1.3.7 we will describe this 
qualitatively by constructing an energy diagram, a plot of the effective potential 
f(r) together with the constant value of the reduced energy s. The energy diagram 
is shown in Fig. 1.11. Recall the two main features of such diagrams: (i) The 
difference between e and v{r) represents \r 2 , the reduced kinetic energy, which 
must be positive; and (ii) points on the diagram for which v{r) — e represent turning 
points of the motion, at which f changes sign, either from positive to negative, or 
from negative to positive. 

Because the effective potential can be negative, it is possible for e to be either 
negative or positive. The nature of the motion depends sensitively on this sign. 

When e < the motion takes place between two turning points at r = r m i n 
and r = r max . The motion is bounded, and as we shall see, the orbit possesses an 
elliptical shape. When e is equal to the minimum value of the effective potential, 
£ = ^min < 0, motion can take place only at r — ro, the radius at which the 
minimum occurs, which is defined by v(r ) — v m i n . The orbit is then circular, 
because r is always zero and r can therefore never change with time. 

When e > the motion takes place only to the right of a single turning point 
at r = r m - m . The motion proceeds from r — oo (where v = and ^r 2 = e) down to 
r = r m j n (where r changes sign from negative to positive), and then back to r = oo. 
The particle traces a hyperbola in the orbital plane, and the motion is said to be 
hyperbolic. 



1.5 Kepler's problem 



33 



When e = the situation is qualitatively the same as before (for s > 0). The 
only difference is that the particle now starts at r — oo with a zero radial velocity, 
because \r 2 = e = 0. The particle then traces a parabola in the orbital plane, and 
the motion is parabolic. 

1.5.8 Circular orbits 

Circular orbits are especially simple to describe. To have circular motion we need 
both r and r to be zero, so that r always stays constant. The condition r = is 
not sufficient, because r might just happen to be in the process of changing sign at 
a turning point; we need a permanent turning point, which we get by also imposing 
r = 0. To get r — we need to impose 

, N GM h 2 

£ = " (ro) = ~ + ^T 

where ro is the orbital radius. To get r — we look back at Eq. (1.5.29) and impose 

„ GM h 2 .. . 
'0 'o 

in which a prime indicates differentiation with respect to r. The second equation 
determines h in terms of r : wc get h 2 = GMr , or 

h = y/GMr , (1.5.34) 

if we choose a positive sign for h. The first equation determines e also in terms of 
r : we get e = -GM/r + GM/(2r ), or 

e = - G ^. (1.5.35) 
The angular velocity of a circular orbit is given by <f> = h/r 2 , = ^/GMr^/rQ, or 

(1.5.36) 

The orbital period P is the time required for <p to advance by 2-k. We have 
yfGM/rl = 2tt/P, which gives 




This equation states that P 2 oc r$, and we have the statement of Kepler's third law 
for circular orbits. 



1.5.9 Shape of orbits 

To go beyond the qualitative description of the orbit we must now fully integrate 
the equations of motion. We shall, to begin with, eliminate the time from the 
equations and focus on the geometrical appearance of the orbit; we shall, in other 
words, derive a differential equation for r((j>) and solve it. We will return later with 
a description of the motion in time. 
We go back to Eq. (1.5.29), 



.. GM h 2 n 

H o 3 =0 ' 
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and to Eq. (1.5.27), 

4>=-. 

To eliminate t from these equations we write 

dr dr d<p h dr 

dt d<p dt r 2 d(j) 

We then have 

2h . dr h d 2 r ■ 

r = — 3 r ^; + ^^2^ 

r 6 d<p r 2 dqr 

dr\ 2 h 2 d 2 r 
d4>) + ^d0 2 ' 

The equation that determines the shape of the orbit is therefore 

h 2 d 2 r _2h 2 / dr\ 2 GM h 2 _ 
r 4 d(f> 2 r 5 \d(f) J r 2 r 3 

or 

dr \ 2 1 GM 

+ 



r h 2 
or 

1 d 2 r 2 fdr\ 2 1 GM 



(1.5.38) 



r 2 d(j) 2 r 3 \d(f> J r h 2 

This is a second-order, nonlinear differential equation for r (</>). 

The standard trick that is used to solve Eq. (1.5.38) is to adopt u = 1/r as the 
dependent variable. Then r = 1/u, 

dr 1 du 

d(f) u 2 def) 1 

and 

d 2 r _ 2 /du\ 2 1 d 2 u 
d(j> 2 u 3 \d4> J u 2 d(j) 2 ' 

With these transformations Eq. (1.5.38) becomes 

2fdu\ 2 d 2 u 2fdu\ 2 GM 



u\d(f> J d(j) 2 u\d(j)) h 2 
or 

d 2 u GM 1 , r n „. 

(L5 - 39) 

The equation is still of second order, but it is now linear. If we write v = u — GM/h 2 
it becomes even simpler: 

d 2 v 

d^ +V -°- 

The solution is v — u cos(</> — cf> ), where u a and (f> are the constants of integration. 
We therefore have 

u = + m cos(0 - 4> ) = [l + e cos(0 - O )] = ^ [l + e cos(0 - O )] , 
where we have put u = GMe/h 2 and p = h 2 /{GM). 
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The final result for r(<f>) is 



r U) = - -, (1.5.40) 

v ' l + ecos0 v ; 



in which we have set O = to simplify the expression. This involves two constants: 
We have p, which plays the role of average radius and is known officially as the 
semilatus rectum, and we have e, which measures the range over which r varies and 
is known as the eccentricity. We have seen that p is related to the reduced angular 
momentum h by 

h = ^jGMp. (1.5.41) 

The eccentricity, on the other hand, can be related to the reduced energy e; as we 
shall calculate in a following paragraph, 



GM 
~2p~ 



(l-e 2 ). (1.5.42) 



This equation is valid for e < 1, which means that e < 0, and it is valid also for 
e > 1, which means that e > 0. 

We have just observed that e < when e < 1. This is the case of bound 
motion, which takes place between two turning points at r = r m ; n = p/(l + e) 
and r = r max = p/(l — e). As we see from Eq. (1.5.40), the motion proceeds from 
t = fmin (known as the orbit's pericentre) when <fi = 0, to r = r max (known as the 
orbit's apocentre) when <j> = ir, and then back to r = r min when <p — 27r. When 
e < 1 the equation r = p/(l + e cos <fi) describes an ellipse. The maximum length of 
this ellipse is r m - m + r max = p/(l + e) + p/(l — e) = 2p/(l — e 2 ). Half of this is the 
ellipse's semi-major axis, 

a = i&e*' (1.5.43) 

These statements give rise to Kepler's first law: A body moving under the grav- 
itational influence of another body follows an elliptical orbit when the motion is 
bounded. When e = we have that Eq. (1.5.40) reduces to r(<p) = p, and the el- 
lipse has become a circle. In this case we have p = r , Eq. (1.5.41) becomes identical 
to Eq. (1.5.34), and Eq. (1.5.42) becomes Eq. (1.5.35). 



Exercise 1.19. Look up a reference book on elementary geometry and review the 
properties of ellipses. Answer the following questions: (1) Is Eq. (1.5.40) really the equation 
of an ellipse? (2) Where are the two foci of the ellipse? (3) What is the semi-minor axis b 
of the ellipse? A good way to answer some of these questions is to show that Eq. (1.5.40) 
is equivalent to the usual description of an ellipse via the equation X 2 /a 2 + Y 2 /b 2 = 1. 
But be careful! In this equation X is not equal to r cos <j> and Y is perhaps not equal to 
rsin^, because the two coordinate systems do not share the same origin. 



We have also observed that e > when e > 1. This is the case of unbound 
motion, which takes place to the right of a single turning point at r — r m j n = 
p/(l + e). In this case the equation r = p/(l + e cos (f>) describes a hyperbola, and we 
have hyperbolic motion. In the special case e = 1 we have s = 0, and the equation 
r =pj (1 +cos (j>) describes a parabola; in this special case we have parabolic motion. 
You may have learned that an ellipse, a hyperbola, and a parabola are all special 
cases of a general family of curves called conic sections. The conic sections are all 
described by the parametric equation r{(j>) =p/(l + ecosc/)). 

Let us now return to the derivation of Eq. (1.5.42). We go back to Eqs. (1.5.30) 
and (1.5.31), 

1 GM h 2 
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and we compute each member of the right-hand side. To evaluate f we start with 
Eq. (1.5.40) and get 

ep sin <p ; e 2 • . e 



= -r 2 d>su\6 = -/isinc 



(l + ecos(/>) 2 p 
This gives us 

1 e 2 , 2 . 2 , GM ,, /i 2 .„ ,., 
£ = --^/i sin (1 + ecos0) + — r(l + ecos0) , 

2 p 2 p 2p 2 

and replacing ft 2 by GMp in this equation yields 
GM 



e = 



2p 



e 2 sin 2 (f) - 2(1 + e cos 0) + (1 + e cos 0) 2 



After simplification the expression within the square brackets becomes e 2 sin 2 <f> — 
1 + e 2 cos 2 (p = — 1 + e 2 , and we arrive at 

e = -^T (1 - e) ' 

the same statement as in Eq. (1.5.42). 

1.5.10 Motion in time 

Now that r((j)) is known we must relate <f> to the time t in order to have a complete 
description of the motion. The relevant equations are <fi — h/r 2 , h — y/GMp, and 
r = p/(l + ecoscf)). Putting this all together, we obtain 



# GM ^ ,. 2 

^ = ^^(l + e cos0) 2 . (1.5.44) 

This is the differential equation that must be solved in order to obtain <p(t). Unless 
e is very small, in which case approximate analytical results can be obtained, this 
equation must be integrated numerically. Results of numerical integrations are 
displayed in Fig. 1.12. 

The integral form of Eq. (1.5.44) is 



<= \/tt77 / T, ^—t^ + constant. (1.5.45) 

V GM J (1 + ecoscf)) 2 v ; 

This indefinite integral cannot be evaluated in closed form, but it provides a nice 
way of calculating the orbital period P of bound orbits (e < 1). Because this is 
equal to the time required for <fi to advance by 27r, or twice the time required for cj> 
to advance by ir, we have 



P = 2 



p3 



GM J (l + ecos0) 2 ' 



This definite integral can be evaluated, and the result is 7r/(l — e 2 ) 3 / 2 . We therefore 
have 



P = 2n 



[P/(1 - e 2 )F 



GM 

We obtain a cleaner form of this result by involving Eq. (1.5.43). In terms of the 
semi-major axis a = p/(l — e 2 ), the orbital period is 



P = 2 ^GM- (L5 - 46) 
We have that P 2 oc a 3 , and this is the general statement of Kepler's third law. 
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radius 

radial velocity 




0.5 1 1.5 2 2.5 3 

time/(orbital period) 

Figure 1.12: Numerical integration of the equations of motion for an orbit with eccen- 
tricity e = 0.5. The blue curve shows the angular velocity as a function of time, the 
green curve shows the radial velocity r as a function of time, and the red curve shows the 
radial position rasa function of time. The time variable is scaled by the orbital period 
P, and three complete orbital cycles are displayed. Notice that the motion starts at the 
pericentre with maximum angular velocity and zero radial velocity. 
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1.5.11 Summary 

The motion of two bodies subjected to their mutual gravity is described by the 
relative position vector r = n — r 2 . When the origin of the coordinate system is 
at the centre of mass we have 

m 2 mi 
n = — r, r 2 = -—r, M = m, 1 +m 2 . 

The vector h = r x r is constant, and h is related to the system's total angular 
momentum by L = (mim2/M)h. The fact that h is constant implies that the 
motion takes place in a fixed plane. Using polar coordinates, the motion is described 
by the functions r(t) and 4>(t). These are determined by the first-order differential 
equations 

1. 2 _ GM + h 2 _ £ ^_h 

2 t 2r 2 r 2 

The constant e is related to the system's total energy by E = (mim2/M)e. The 
shape of the orbit is described by 

rU) = V - . 

1 + e cos (j> 

The orbital elements (p, e) are related to (h, e) by 

h=^GMp, e = - — (l-e 2 ). 

p ' 

The motion in time is determined by numerically integrating 



V ~p^~ + e cos ' 

When e < 1 the motion is elliptical, and the ellipse's semi-major axis is 

P 

a = ^ 2- 

1 — e z 

The orbital period is then 

P = 2n 



GM 



1.6 Appendix: Numerical integration of 
differential equations 

Some of the results presented in this Chapter were obtained by numerical integra- 
tion. Some of our future results also will be obtained using numerical techniques. 
In this Appendix we explain the fundamental ideas behind these numerical meth- 
ods. These ideas are implemented in various available packages, for example, within 
Maple, or within subroutines found in the book Numerical Recipes. 
To begin, we examine a first-order differential equation of the form 

£ = /(»), (1.6.1) 

where x is the independent variable, y the dependent variable, and / an arbitrary 
function of y. A concrete example is 
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which we encountered in Sec. 1.5; here x = t, y = <j>, and / stands for what appears 
on the right-hand side of the preceding equation. 

We seek to determine y(x) in the interval ^initial < x < a;g na i, starting from the 
known value f/mitiai at x = ^initial ■ The essential idea is to break down the continuum 
between ^initial and Xf\ na i into a finite number of discrete points separated by a small 
interval A. The computational grid is then 

X n = ^initial + "A, 71 = 0, 1, 2, • • • , N, (1.6.2) 

where N is the total number of points; we have A = (xfi na i — a^initiai)/^. Cor- 
respondingly, we have the sampled values y n = y(x n ) of the dependent variable, 
which we wish to determine. We shall do so by turning the differential equation 
dy/dx = f(y) into a finite- difference equation. 

Consider the first step of moving from xq = ^initial to x\. We know yo = j/mitiai, 
and we wish to determine y\. Because A is small it is safe to assume that the 
function f(y) changes by very little in the interval between y and y\. We may 
approximate it by its Taylor expansion about y = y : 

f(y) = f(y ) + f'(yo)(y-y ) + --- 

= /wfi+rvw^-tti) + •••]■ 

The differential equation gives 

dx = ^ = j^ ) [ 1 -r 1 f'(yo)(y-yo) + --]dy, 

where we have used the identity (1 + e) a = 1 + ae + 0(e 2 ), which holds for any 
small quantity e and any power a — this identity also can be established by Taylor 
expansion. Integrating the preceding equation gives 

{yi - yo) - ^f~ 1 f'{yo)(yi - yo) 2 + ■■■ , 

or 

f(y )A = ( yi - yo) - V'O/oXyi - yof + ■ ■ ■ . 
This equation can be solved formally for y\ — y : 

yi-yo - .f{yo)A + \.r 1 f'{yo){yi-yo) 2 + --- 

= f( yo )A+±r 1 f( yo )[f(y )A + ---} 2 + ■■■ 
= f(y )A + 1 -f.f(y )A 2 + ■■■. 

We write this result as 

yi = yo + f(y )A + \ff{yo)A 2 + 0(A 3 ), (1.6.3) 

indicating that the error of this approximation for y\ is of order A 3 and therefore 
quite small. 

A cruder approximation for y\ is 

Vi =yo + .f(yo)A + 0(A 2 ), 

and this approximation is at the core of Euler's method to solve the differential 
equation: From the known value yo compute f{yo) and multiply by A; add the 



xi -x Q = 



f{yo) 
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result to yo to get y\, and repeat the procedure to get 2/2, 2/3, and so on. Euler's 
method is very simple and economical, but because its error term is of order A 2 , 
it is not very accurate. With a little cleverness, however, it is possible to improve 
the accuracy of the method so that its error term becomes of order A 3 <C A 2 . One 
way of achieving this would be to use Eq. (1.6.3) instead of its cruder version. The 
price to pay would be the need to evaluate f'(y), the derivative of the function with 
respect to y. This may not be practical in some circumstances, and there is an 
alternative method. 

Consider evaluating the function / not at y — yo, but at the midpoint between 
y and y t ~ y + /(t/o)A: 

/(yo) - f(y + i/oA), 

where we use the notation f = /(yo)- By Taylor expansion we have 
/(2/o + l/oA) = /(jyo) + /'(2/o)(i/oA) + 0(A 2 ) 
= /(2/o) + ^//'(2/o)A + 0(A 2 ), 

and this shows that Eq. (1.6.3) is equivalent to 

Vi = 2/o + f(vo + 5/oA)A + 0(A 3 ). (1.6.4) 

This approximation for y\ has an error term of order A 3 , and it is obtained simply 
by evaluating the function / at the midpoint; the value of its derivative is not 
needed. 

By being increasingly clever it is possible to decrease further the size of the 
error term. The fourth-order Runge-Kutta method consists of the following recipe. 
Suppose that the differential equation has been integrated up to x — x n , and that 
we wish to proceed to the next grid point, at x = x n+1 . We have therefore obtained 
y n and we wish to calculate y n +i- First we compute the auxiliary quantities 

h = f(y n )A, 

k 2 = /(i/n + 5&l)A, 

h = f(y n + ^k 2 )A, 
h = f{y n + k 3 )A, 

and next we approximate y n +i by 

2/n+i = Vn + ;Ui + \k 2 + ifc 3 + ifc 4 + 0(A 5 ). (1.6.5) 

As indicated, the judicious choice of coefficients in front of hi, k 2 , k 3 , and fc 4 ensures 
that the error term is now of order A 5 , and therefore very small. The Runge-Kutta 
method is easy to implement, it is accurate, and it is robust; it works well for most 
functions f(y). The method can also be generalized to handle functions f(x, y) that 
depend on both variables. 

The method also generalizes to a set of differential equations 

*=/[i](l/[l],l/[2],-"). i = l,V" (1-6.6) 

for a set of dependent variables y[i]. In this case the auxiliary quantities fci, k 2 , ks, 
and fc 4 acquire an index [i]; for example we now have 



fciW =/W(2/„[l],2/»[2],---)A 
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and 

= /W(y n [l] + |fci[l],»n[2] + §*i[2], • • -)A. 

This generalization is useful, because it allows us to use the method to integrate 
second-order differential equations. Consider, for example, the pendulum equation 
of Eq. (1.3.24), 

ij + uJ 2 s\n6 = 0. 

This can be recast as a system of two first-order differential equations. To do this 
we define y[l] = 9 and y[2] — 6. When then have the system 



dt 



! sin(j/[l]). 



In this instance we find that /[l](y[l],y [2]) = y[2] and /[2](y[l], y[2]) = -lo 2 sin(y[l]). 
This system of equations can be integrated straightforwardly, and the result for 
y[l](t) is the numerical approximation to 6{t), the solution to the original second- 
order equation. 



1.7 Problems 

1. Let 

A = (3x 2 - 6yz)x + (2y + 3xz)y + (1 - Axyz 2 )z. 

Calculate J A ■ ds along the following paths that link the point (0, 0, 0) to 
the point (1, 1, 1): 

(a) The curve described by x = u, y = u 2 , and z = u 3 , in which the parameter 
u is restricted to the interval < u < 1. 

(b) The straight line that joins these points. 

2. Let 

A = (2xy + z 3 )x + (x 2 + 2y)y + (3xz 2 - 2)z. 

Find the function / such that A = V/. Then evaluate J c A ■ ds along any 
path C that links the point (1, — 1, 1) to the point (2, 1, 2). 

3. Evaluate § c r ■ ds along all closed loops C, where r = xx + yy + zz is the 
position vector. 

4. A projectile is launched with initial speed vq at an angle a with the horizontal. 
Calculate: 

(a) the position vector as a function of time; 

(b) the time required to reach the highest point; 

(c) the maximum height reached by the projectile; 

(d) the time of flight back to the Earth's surface; 

(e) the range of the projectile; 

(f) the angle a which maximizes the range. 

5. Suppose that in the preceding problem, the projectile is also subjected to a 
frictional force equal to — kv, where v is the velocity vector and k a positive 
constant. Find: 

(a) the velocity vector as a function of time; 
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(b) the position vector as a function of time; 

(c) the terminal velocity of the projectile. 

For ease of notation set k — m/r. 

6. A particle of mass m is traveling in the x direction. At time t = it is located 
at x — and has a speed vq. The particle is subjected to a frictional force 
which opposes the motion; its magnitude is equal to (3v 2 , where v — v(t) is 
the particle's speed at time t and f3 is a positive constant. 

(a) What is the speed of the particle as a function of time? 

(b) What is the position of the particle as a function of time? 

7. A particle of mass m rests on top of a sphere of radius R. The particle is then 
displaced slightly so that it starts to move down the sphere. (It is assumed 
that the particle slides down without rolling and without friction.) As it moves 
down the sphere, the particle makes an angle 9 with the vertical direction. At 
some point the particle loses contact with the surface of the sphere, and it 
proceeds to fall freely. We are interested in the motion of the particle from 
the initial moment where it is at rest to the final moment where it leaves the 
sphere. 

(a) Derive an equation of motion for 9(t), and find an expression for A, the 
magnitude of the normal force. 

(b) At which angle 9 does the particle leave the surface of the sphere? 

(c) What is the speed of the particle when it leaves the surface of the sphere? 

[Hint: This problem is involved. You may find it useful to resolve the force 
and acceleration vectors into a basis that consists of f, a unit vector that 
points in the direction normal to the sphere, and 0, a unit vector that points 
in the direction of increasing 9.] 

8. The planar pendulum of Sec. 1.2.7 is now subjected to a frictional force 
-Pfriction = — (to/t)v, where t is a positive constant. Derive the new equa- 
tion of motion for the swing angle 9. 

9. The equation of motion of the preceding problem reduces to 

9 + 2 7 (9 + lu 2 9 = 

when the oscillations have a very small amplitude; here 7 is a positive constant 
that is related to r in the preceding problem. Find the general solution to this 
equation. Assume that u 2 > j 2 , so that the oscillations are undcrdamped. 
[Be sure that your final expression for 9{t) is a real (not complex) function.] 

10. A mass m is allowed to move along the x axis, either in the positive or in the 
negative direction. It is subjected to a constant force +F when x < and to 
a constant force — F when x > (here F is positive). 

(a) Describe the motion qualitatively with the help of an energy diagram. 

(b) Calculate the period of the motion; express your result in terms of m, F, 
and the amplitude A of the motion. 

11. A cylindrical cork is partially immersed in a liquid of (mass) density p. The 
cork's axis is oriented with the vertical direction, and the cork floats in the 
liquid. The cylinder's cross-sectional area is A, and its mass is m. The 
cork is gently pushed down into the liquid and then released; it starts to 
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oscillate. Neglecting any damping effect, calculate the angular frequency ui of 
the oscillations. 

[Hint: The restoring force is provided by the buoyancy of the liquid. According 
to the Archimedes principle, the buoyant force is equal to the weight of the 
liquid displaced by the cork.] 

12. For Kepler's problem, prove that the square of the velocity vector v — r can 
be expressed as 

v 2 = GM{ 



(r a)' 



where a — p/(l — e 2 ) is the ellipse's semi-major axis. What is v at pericentre? 
What is v at apocentre? (Express your results in units of the average speed 
v = y/GM/a.) 

13. We have seen that the description of Keplerian motion in time can be ob- 
tained by integrating Eq. (1.4.52). An alternative method is based on the 
representation 

r = a(l — ecosip), 

in which r is expressed in terms of the eccentric anomaly ip; this is an angular 
parameter that ranges from to 27r as each body completes one full orbit. We 
wish to find ip as a function of time. 

(a) Starting from the statement of energy conservation, \r 2 + u{r) = e, in 
which you are to substitute h 2 = GMa{\ — e 2 ) and e = —GM/(2a), 
derive an expression for ip. Make sure that this expression is simplified 
to the full extent possible. 

(b) Integrate the equation for ip that was obtained in part (a). Show that 
the solution is 

• , \GM , \ 
ip - e sin ip = W — j- (t-t ), 

where t is the time at which ip = 0. This is Kepler's equation, and 
it can be numerically inverted to yield ip(t). This method is the most 
convenient to find the behaviour of r as a function of time. 

14. We examine a special case of Kepler's problem. We set h = 0, so that <p = 0. 
We have purely radial motion, and the equation for r(t) reduces to 

1 2 GM 

2 r 

where e is the reduced total energy. 

(a) Construct an energy diagram for this situation. Describe the motion 
qualitatively when e > 0, when e = 0, and when e < 0. 

In the rest of the problem we consider the subcase e < in some detail. 

(b) Relate e to r max , the maximum value of r at which a turning point occurs. 

(c) Imagine that the motion proceeds from r = r max to r = 0. We represent 
this mathematically in terms of an auxiliary variable, the angle 77. We 
write 

r (v) = 2 r max(l + COS 77), 

and we let 77 vary from 77 = to i] = it. Calculate 77 = drj/dt and solve 
this for t(ri). The motion is now completely determined. Provide a plot 
of r as a function of t. 
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(d) What is the total time required for the bodies to go from r = r max to 
r = 0? 

15. Two particles move about each other in circular orbits under the influence 
of their gravitational attraction; their orbital period is r. Their motion is 
suddenly stopped at a given instant of time, and they are then released and 
allowed to fall into each other. Calculate the time required for them to collide; 
express your answer in terms of r. 

[Hint: You will need the result of part (d) of the preceding problem.] 

16. The escape velocity of a particle on Earth is the minimum velocity required on 
the Earth's surface for the particle to escape the Earth's gravitational field. 
Neglecting air resistance within the atmosphere, calculate this in terms of the 
Earth's mass M and its radius R. Evaluate this numerically and show that it 
is close to 11 km/s. 

17. Two bodies of mass mi and m 2 are subjected to their mutual forces, so that 
the force acting on body 1 due to body 2 is F\ 2 — —f{r)r, while the force 
acting on body 2 due to body 1 is F 21 = +f(r)f. Here / is an arbitrary 
function of r = |ri — r 2 \, the distance between the bodies, and the forces 
are directed along r = r/r, where r = r x — r 2 ; such forces are called central 
forces. 

(a) Derive an equation of motion for R, the position of the centre of mass. 

(b) Derive an equation of motion for r, the position of body 1 relative to 
body 2. 

(c) Prove that h = r x r is a constant vector; conclude that the motion takes 
place in a fixed plane. 

(d) Introduce the polar coordinates (r, <f>) and prove that \h\ = h = r 2 <p; 
conclude that Kepler's law — the law of areas — is valid for all central 
forces, and not just for gravity. 

(e) Show that the equation of motion for r reduces to 

r+^- — =0, 
[i r A 

where /j = to 1 to 2 /(™i + m 2) is known as the reduced mass of the two- 
body system. 

(f) Prove that the shape of the orbit is determined by 

d 2 u f 
d(j) 2 fih 2 u 2 ' 

where u = 1/r. 

18. A central force / = k/r n , where k is a constant and n an integer, is known to 
produce an orbit described by r — ae~*, where a is a constant. 

(a) Plot this orbit in the x-y plane. 

(b) Determine the integer n. 

19. A central force / = k/r n , where k is a constant and n an integer, is known to 
produce an orbit described by r = a^cos 20, where a is a constant. 

(a) Plot this orbit in the x-y plane. 
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(b) Determine the integer n. 
20. A two-body system moves under the influence of a central force given by 




where a and b are constants. 

(a) Show that the shape of the orbit is described by 

r = - 

1 + ecos(k(j)) ' 

where p, e, and k are constants. Express p and k in terms of a, b, h 2 , 
and ii. (Assume that b < fih 2 .) 

(b) Plot the orbit in the x-y plane. Set p = 1, e = 0.6, k = 0.99, and let <f> 
range from to 16n. What is happening to the major axis of the ellipse? 

1.8 Additional problems 

1. An inclined plane makes an angle a with the horizontal. A projectile is 
launched from point A at the bottom of the inclined plane. Its initial speed 
is wo, and its initial velocity vector makes an angle (3 with the horizontal. 
The projectile eventually hits the inclined plane at point B. Air resistance is 
negligible. 

(a) Calculate the range R of the projectile, the distance between points A 
and B. Show that it it can be expressed in the form 

R = i?o sin(/3 — a) cos (3 

and find an expression for R . 

(b) Find the angle /3 max which maximizes the range. 

2. A particle traveling in the positive x direction is subjected to a force F = kx 3 . 
The particle started from an initial position xq < 0. Draw an energy diagram 
for this situation and provide a qualitative description of the possible motions. 

3. Two bodies of masses mi and m 2 are subjected to a mutual attractive force 
F\2 = — /cTOim 2 r, where k is a constant and r = r\ — r 2 is the relative position 
vector. 

(a) Show that the equation of motion for r(t) can be put in the form of an 
energy equation, 

-f 2 + v{r) = e, 

and find an expression for z^(r), the effective potential. Draw an energy 
diagram for this system and give a qualitative description of the possible 
motions. 

(b) Prove that 

r (4>) = , 

describes the shape of the orbit, and solve for r in terms of the constants 
e, M — nil + m2, h, and k. 
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4. The parabolic coordinates u and v are sometimes useful to describe the motion 
of a particle in a two-dimensional plane. These are related to the Cartesian 
coordinates x and y by 

1 / 2 2\ 

.x = uv, y — - (u — v ). 

(a) Sketch the shapes of the curves u = constant in the x-y plane. 

(b) Sketch the shapes of the curves v = constant in the x-y plane. 

(c) Find the unit vectors u and v associated with this coordinate system. 



Chapter 2 
Lagrangian mechanics 



2.1 Introduction: From Newton to Lagrange 

The methods of Newtonian mechanics, based on the vectorial equation F = ma, are 
very powerful and they can be applied to all mechanical systems. But they lack in 
efficiency when Cartesian coordinates (x, y, z) do not give the simplest description 
of a mechanical system. An example is the problem of the pendulum (Sec. 1.3.7), 
which is best analyzed in terms of the swing angle 0; we have seen that to derive 
the equation of motion for 9{t) requires somewhat laborious calculations, and the 
reason is precisely that 9 is not a Cartesian coordinate. Another example is Kepler's 
problem (Sec. 1.5), which is best analyzed in terms of the polar coordinates (r, </>); 
again we saw (back in Sec. 1.5.4) that to derive equations of motion for r(t) and 
4>{t) required some long calculations. 

To increase the efficiency of the theoretical methods of mechanics, a number of 
scientists in the centuries following Newton endeavoured to recast the Newtonian 
laws into a more flexible formulation. The most famous players include Leonhard 
Euler (1707-1783), Joseph Lagrange (1736-1813), William Rowan Hamilton (1805- 
1865), and Carl Gustav Jacobi (1804-1851). Their new techniques proved extremely 
useful, and they allowed them and others to solve increasingly challenging problems, 
most notably in the context of celestial mechanics. These new powerful techniques 
are the topic of this chapter on Lagrangian mechanics, and the following chapter 
on Hamiltonian mechanics. 

It is important to point out that the Lagrangian and Hamiltonian formulations 
of the laws of mechanics are largely restricted to forces that can be derived from 
a potential. For other problems, such as a particle subjected to air resistance, the 
new techniques cannot be applied in a very straightforward way, and it is usually 
best to go back to the old Newtonian methods. In this chapter and the next, we 
shall consider only forces that can be derived from a potential. 

The entire content of Lagrangian mechanics is summarized in the following sim- 
ple recipe: 

1. Select generalized coordinates q a to describe the degrees of freedom of a me- 
chanical system. These coordinates are completely arbitrary. They need not 
be the original Cartesian coordinates associated with an inertial frame. In- 
deed, there is no need for the coordinates to even be attached to an inertial 
frame. The index a — 1, 2, ■ ■ ■ labels each one of the generalized coordinates; 
there is one coordinate for each degree of freedom. 

2. In terms of the generalized coordinates, calculate the system's total kinetic 
energy T and total potential energy V. Then form what is known as the 
Lagrangian function of the system, which is denoted L(q a ,q a ); this depends 
on the generalized coordinates q a and the generalized velocities q a = dq a /dt. 
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The Lagrangian is denned by 

L = T-V; 

it is the difference between the kinetic and potential energies. 

3. Substitute the Lagrangian into the Euler- Lagrange (EL) equations, 

d dL dL _ Q 
dt dq a dq a 

This returns an equation of motion for each generalized coordinate q a (t). 
There is one EL equation for each generalized coordinate. 

4. The rest of the recipe is concerned with solving the equations of motion. The 
methods for doing this are varied, and they depend on the particular situation, 
just as they do in the Newtonian formulation. 

Let us first verify that the recipe is compatible with Newton's laws. Consider a 
particle moving in three-dimensional space and subjected to a potential V(x,y,z). 
As indicated, we use Cartesian coordinates to describe the motion of the particle. 
In this case, therefore, the generalized coordinates are chosen as q\ = x, qi = y, and 
g 3 = z. The particle's kinetic energy is T = \m{x 2 + y 2 + i 2 ), and the Lagrangian 
function is 

L(x, y, z, x, y, z) = -m(x 2 + y 2 + z 2 ) - V(x, y, z). 

To substitute this into the EL equation for q\ = x, say, we must first evaluate 
dL/dx. This is the derivative of L with respect to x, treating all other variables 
(including x) as constant parameters. This is given by 

dL 

— = mx. 
ox 

We next differentiate this with respect to t, and get 

d^dL _ .. 
dt dx 

Finally, we differentiate L with respect to x, treating all other variables (including 
x) as constant parameters; this gives 

dL _ _dV 
dx dx 

Substituting these results into the EL equation for x, we arrive at 

•• dV n 
mx + — — = 0. 
ox 

Repeating these calculations for y and z would eventually return the full vectorial 
equation 

ma + W = 0, 

or ma = F if we recall that the force is derived from the potential, so that F — 
— W. This exercise reveals that indeed, the Lagrangian recipe is compatible with 
the Newtonian law. 

The true power of the recipe, however, is revealed when the generalized coor- 
dinates are not Cartesian. Let us see what the recipe produces in the case of the 
pendulum. Recall from Sec. 1.3.7 that the pendulum's single degree of freedom 
is best represented by the swing angle 9; this will be our generalized coordinate 
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for this problem, and we write q = 9. (We do not need a label a in this case, as 
there is only one generalized coordinate.) The relation between 9 and the original 
Cartesian coordinates is x = ^sin# and z = £cos9, with I denoting the length of 
the rod. The pendulum's kinetic energy is T = \{x 2 + i 2 ) = ^m£ 2 9 . Its potential 
energy is V — —mgz = —mglcosG = — ml 2 uo 2 cos 8, where we have reintroduced 
the quantity u 2 = g/i. The pendulum's Lagrangian function is 

L(6,6) = m£ 2 ( \e 2 +uj 2 cos9 



To substitute this into the EL equation we must first evaluate dL/dO, the partial 
derivative of L with respect to 9. This is 

= m£ 2 9. 

89 

Next we differentiate this with respect to time, and obtain 

= mtO. 

dt de 

Finally we calculate the partial derivative of L with respect to 9, which yields 

— - = —ml uj smO. 
o9 

Substituting these results into the EL equation produces 

m£ 2 (9 + lo 2 sin 6) = 0, 

the same pendulum equation as in Eq. (1.3.24). Comparing the computations car- 
ried out here to those required in Sec. 1.3.7, the greater efficiency of the Lagrangian 
recipe should come out loud and clear. 

It is possible to derive the Lagrangian recipe from Newton's law, F = ma. The 
derivation is fairly laborious, and it involves performing a transformation from the 
original Cartesian system (x,y, z) to the generalized coordinates q a . It is possible, 
however, and more interesting, to derive the recipe from a new physical principle. 
Instead of postulating the validity of F = ma as the starting point of Newtonian 
mechanics, we shall instead adopt the principle of least action as the starting point 
of Lagrangian mechanics. As we shall see in the next two sections, the Euler- 
Lagrange equations can be derived as a direct consequence of the principle of least 
action, and as we have already seen, these are fully compatible with Newton's law. 
What we have, therefore, is the Newtonian postulate arising as a consequence of 
the new principle. More importantly, we have the more flexible framework of the 
EL equations arising as a consequence of the principle of least action. 

As we shall see below, the principle of least action states that of all the possi- 
ble paths q a (t) that a mechanical system could take to go from configuration 1 to 
configuration 2, the paths that are actually taken are the ones which minimize the 
system's action functional, defined by 

S[q a (t)} = f L{q a ,q a )dt. 



This beautiful statement is mathematically equivalent to the full set of EL equa- 
tions, which give rise to the equations of motion that determine the actual paths 
of the system. This formulation of the laws of mechanics, in terms of a least-action 
principle, is economical and conceptually compelling. It is also extremely powerful: 
Virtually all fundamental laws of physics (including field theories) can be formulated 
in terms of such an action principle. 
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Figure 2.1: A curve in the x-y plane that links the points (—1,0) and (1,0). 



2.2 Calculus of variations 



In this section we introduce the mathematical tools — the calculus of variations 
— that are required in the derivation of the Euler-Lagrange (EL) equations from 
the principle of least action. We will look at this issue from a purely mathematical 
point of view, and return to the physics in the next section. 



2. 2. 1 Curve of maximum area 

Let us examine the following mathematical problem. We consider the infinite num- 
ber of curves in the x-y plane that link the point (x = — l,y = 0) to the point 
(x = +l,y = 0); see Fig. 2.1. Of all these curves we select those that have a total 
arc length (the total distance traveled along the curve) equal to w. Of all the curves 
that are left we wish to find the one which maximizes the area under the curve. 
(Notice that the mathematical problem involves maximization of an area, while the 
physical problem involves minimization of an action. The mathematical techniques 
to be developed below work for both cases, maximization and minimization, and 
they do not care about the identity of the quantity to be extremized.) 

We describe the family of curves introduced in the previous paragraph by para- 
metric relations x(s) and y(s), in which the parameter s is the curve's arc length, 
calculated from the starting point (—1,0). Because all the curves within the family 
have a total arc length of 7T, the parameter s ranges from to 7r as each curve runs 
from (—1,0) to (+1,0). We have ds 2 = dx 2 + dy 2 , and this relation implies that 
the functions x(s) and y(s) are not independent of each other. The area under the 
curve is obtained by integration, A — J y dx, which we write as 



A= j y(s)^-ds. 
Jo ds 



We can replace the factor dx/ds by y/l — y' 2 , where y' = dy/ds. This gives us, 
finally, 



A= yy/l - y' 2 ds. (2.2.1) 



We wish to find the function y(s) that produces the largest possible value for A. 
Once this function is identified, x(s) can be obtained by integrating the equation 

x' = ^l-y' 2 - (2.2.2) 



The maximal curve is then fully determined. 
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fix) 




X 



X 



Figure 2.2: A function with a maximum point at x = x. Because this is an extremum 
point, a displacement around x produces the smallest change in the function. 



To proceed it is helpful to broaden the scope of the preceding discussion and to 
examine the general structure of the mathematical problem. We are given a func- 
tional A[y], a function A of a function y(s), which we wish to maximize, or perhaps 
minimize, with respect to the choice of path y(s). (In general we say that we wish to 
find the extremum of the functional, and we shall never need to distinguish between 
a maximum and a minimum.) The functional has the following structure: 



it is given by an integral over a parameter s of a function G which depends on the 
path y(s) and its derivative y'{s) — dy/ds. The integral can be evaluated for any 
choice of trial function ytnai{s), and the result is a number Atrial- We are looking for 
the function y(s) that produces the largest (or smallest) number. In mathematical 
terms, we are looking for the extremum of the functional A[y\. 

The mathematical task of extremizing a function fix) with respect to its argu- 
ment x — the argument being a number — is a simple one: We simply calculate 
the derivative of the function and set the result equal to zero; the solutions to 
df /dx = are all extremum points (minima and maxima) of the function. To ex- 
tremize a functional A[y] with respect to a functional argument y(s) is a much more 
delicate task. How does one do this? 

Let us examine more closely the straightforward task of finding an extremum 
of a function fix). We imagine, for concreteness, that the function has a single 
maximum at x = x; this is represented in Fig. 2.2. We have, of course, fix) = 0, 
with a prime indicating differentiation with respect to x. 

An important property of x is that it is the point from which the function 
fix) changes the least when x is displaced from x to a neighbouring point x + dx. 
That this is so can easily be seen from the figure, but it is just as easy to prove it 
mathematically. Let us calculate df, the change induced in the function when its 
argument x is moved to a neighbouring point x + 5x. By Taylor's theorem we have 



2.2.2 Extremum of a functional 




(2.2.3) 



Sf 



f{x + Sx) - fix) 
f'{ x )8x+ l -f"{x){8xf + ---. 
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y(s) 




yi 



y 



s 



si 



s 



Figure 2.3: A family of paths which leave y — yo when s — so and arrive at y — y\ 
when a = s\. 

From this calculation we learn that in general, the change in the function is propor- 
tional to 8x, as we might have expected. But when we let x become an extremum 
point x, we get a different result. In this case we have f'(x) = and the preceding 
equation becomes 



Now the change in the function is proportional to (8x) 2 , and this is much smaller 
than what we get in the general case. We have just found that the variation 8f is 
smallest when it is taken at an extremum point. A useful way of characterizing an 
extremum point is therefore to say that it is a point from which a displacement Sx 
produces a vanishing change to linear order in Sx. (The change is not actually 
zero, but it is of second order in Sx, as we have shown.) 

We shall use the same idea to find the extremum path of a functional. We will 
look for a path y(s) — analogous to the extremum point x — that has the property 
that a displacement away from this path produces no change in the functional 
A[y], to linear order in the displacement Sy(s). In other words, if we evaluate the 
function on the extremum path y(s) and get the number A, we will find that if we 
then evaluate the functional on the displaced path y(s) = y(s) + 5y(s), we will still 
get the number A, except for a correction of second order in the displacement; the 
change 8 A is zero to first order in Sy(s). 

To flesh this out let us consider all paths y(s) that leave the point y — yo when 
s = so and arrive at the point y = y\ when s = s\\ members of this family of curves 
are displayed in Fig. 1.3. Out of all these possible paths that link y and y% we wish 
to find the one which extremizes the functional A[y\. Our strategy will be to assume 
the existence of an extremum path, which we denote y(s), and which we treat as a 
reference path. We shall examine what happens to A[y] when we displace the path 
from y(s) = y(s) to y(s) = y(s) + Sy(s). While we shall find that in general, this 
produces a change 8 A that is proportional to Sy(s), we will instead demand that 8 A 
vanish to first order in the displacement; as we shall see, this procedure will permit 
us to identify the extremum path y{s). To carry out this procedure properly it is 
important to ensure that all the considered paths begin and end at the same two 
end points. The reference path y(s) and the displaced paths y(s) — y(s) + 8y(s) 
must all satisfy y(so) = yo and y(si) = y\. This implies that the displacement 
Sy(s), which are completely arbitrary in the interval Sq < s < s l7 must satisfy the 



8f= 2 f"(x)(8xf + ---. 
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Figure 2.4: The reference path y(s) (in blue) and a displaced path y(s) = y(s) + Sy(s) 
(in red). The displacement is arbitrary away from the two end points, but it must vanish 
at the end points. 



boundary conditions 



8y(s ) = = Syisx). 



(2.2.4) 



The situation is illustrated in Fig. 2.4. 

We evaluate first the functional A[y] on the reference path y(s); this is 

A = A[y}= G(y,y')ds. 

J so 

We next evaluate the functional on a displaced path y(s) = y(s) + Sy(s); this is 
A[y + Sy}= [ G{y + 5y,y' + Sy') ds, 



where Sy' = y' — y' — d(y — y)/ds = d{5y)/ds. The change in the functional is 
5 A = A[y + Sy]-A[y] 



G(y + Sy,y' + 5y')~G(y,y') 



ds. 



and we wish to find conditions on y(s) that will allow us to set SA = 0, up to 
corrections of second order in Sy. 

The function G depends on two variables, y(s) and y'(s). By Taylor's theorem 
we have 



G(y + Sy,y' + Sy')=G(y,y') + 



dG 
dy 



Sy 



v=y,v =y 



dG 
dy' 



Sy' + ---, 



v=v,v =y 



where we omit terms of higher order than first in the displacements 8y(s) and Sy'(s) 
The change in functional is therefore 



SA 



dG . dG . / 
dy dy' 



ds, 



where we again neglect higher-order terms, and where we discard the signs \ y= y >y i—yi 
that instruct us to evaluate the partial derivatives on the reference path y(s); this 
operation will henceforth be understood. 
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Recalling that Sy' 
gral: 



d{5y)/ds, we manipulate the second term within the inte- 



dG 
dy 



7 Sy' ds 



dG 



7 d(Sy) 



dy 



\dy> 



-Syj -5yd 



/dG 



Sy-—ds 
ds dy' 



This term can be integrated by parts, and we obtain 



5A- 



dG 

dy' 



-,5y 



+ 



f 

J Sn 



dG _ d dG 
dy ds dy' 



Sy(s) ds. 



This result simplifies by virtue of Eq. (2.2.4): Because the displacement Sy must 
vanish at the two end points, the boundary terms are necessarily zero. We end up 
with 

f Sl \dG d dG] . . . , 

= L [^-dsw\ s ^ ds - 



SA 



(2.2.5) 



The functional A[y] will be an extremum if SA vanishes for all displacements Sy(s) 
that satisfy the boundary conditions of Eq. (2.2.4). As we shall show presently, 
this will happen if and only if the quantity within square brackets vanishes. We 
therefore have the statement 



SA = 



d^dG _ dG 
ds dy' dy 



= 0. 



(2.2.6) 



This is the Euler-Lagrange (EL) equation associated with the function G(y, y') 
which defines the functional A[y]. When fully worked out, the EL equation takes 
the form of a second-order differential equation for the function y(s). Solving this 
equation gives the extremum path y(s). 

To justify Eq. (2.2.6) we consider any integral of the form 



E(s)n(s) ds, 



which is known to vanish for any choice of function n(s). [Here E(s) plays the 
role of the quantity within square brackets in Eq. (2.2.5), and n(s) plays the role of 
Sy(s).] What does this tell us about E(s)l To answer this let us design the arbitrary 
function n(s) to suit our purposes. Let us imagine that it is everywhere positive 
and very sharply peaked near some value of s between so and s±, say s = s*. Under 
these conditions the integral can be approximated by 



E(s 



*) f 

J Sn 



n(s) ds, 



and since the integral cannot be zero, we must conclude that E(s*) = 0. Because 
the value of s* is arbitrary, we can safely conclude that E(s) must vanish everywhere 
in the interval so < s < s i- ln this way we have shown that Eq. (2.2.5) leads to 
Eq. (2.2.6) whenever the displacement Sy(s) is arbitrary. 

To sum up, we have shown in this subsection that an extremum path of the 
functional 



A[y] 



f 

J Sn 



G(y,y')ds 
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is obtained by finding a solution y(s) to the EL equation 

d^dG _dG_ 
ds dy' dy 

This statement is true whether the extremum is a maximum or a minimum, and it 
is independent of the detailed nature of the function G{y,y'). Any function of the 
two variables y(s) and y'(s) can thus be substituted inside the functional, and our 
calculus of variations applies to a very wide range of situations. 

2.2.3 Curve of maximum area (continued) 
The function G that corresponds to our original problem is 

G(y,y') = yy/l-y' 2 . (2.2.7) 

Substitution of this function into the EL equation will produce a second-order dif- 
ferential equation for y(s). Solving this will give us the curve that maximizes the 
area. 

When we substitute Eq. (2.2.7) into Eq. (2.2.6) we must first calculate the 
derivative of G with respect to y' , treating y as a constant parameter. This is 

We next differentiate this with respect to s. Because dG/dy' = G y < depends on s 
through its dependence on both y and y', we must apply the chain rule. This gives 

d_dG dGy^dy dG y , dy 1 

ds dy' dy ds dy' ds 



dy dy' 



We have 



and 



so that 



d -lf = -y'[i-yT 1/2 

dG v' _ „.ri „/2i-3/ 2 



dy' 



-y[i-y n 



ddG - - y ' 2 [i-y'T 1/2 -yy"[i-y'T 3/2 - 



ds dy' 

The remaining quantity to calculate is 

After cleaning up the algebra we find that the EL equation is 

yy" -y' 2 + 1 = 0. (2.2.8) 
This is a nonlinear, second-order differential equation for the function y(s). 

Exercise 2.1. Make sure that you can reproduce the computations that lead to 
Eq. (2.2.8). 

The general solution to Eq. (2.2.8) is 

1 / 
y = — smci(s + c 2 ), 

Cl 
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where c\ and c 2 are two constants. That this is indeed a solution can be verified 
by direct substitution; that this is the general solution can be seen from the fact 
that it depends on two arbitrary constants, the correct number for a second-order 
differential equation. These constants are determined by enforcing the boundary 
conditions y(s = 0) = and y(s = tt) = 0, which follow from the requirement that 
the maximum curve must link the points (—1,0) and (+1,0). The first condition 
gives (1/ci) sin(cic 2 ) = 0, which implies that c 2 = 0. The second condition gives 
(1/ci) sin(ci7r) = 0, which implies that C\ must be an integer, which we call n. We 
therefore have 

y(s) — — sinns, y (s) — cosns. 
n 

We may now look for x(s), which is determined by Eq. (2.2.2), 

x' = \J\ — y' 2 = \J\ — cos 2 ns = sin ns. 

This integrates to a; = xq — (1/n) cos ns, where x n is another constant of integration. 
We must now impose the boundary conditions x(s = 0) = —1 and x(s = tt) = +1. 
The first condition gives x — 1/n = —1, so that x = —1 + l/n. The second 
condition gives — 1 + (1 — cosmr)/n = 1, or cosn7r = 1 — 2n, which implies that 
n = 1. We therefore have x = — coss, and the constraint n = 1 also implies 
y = sin s. 



Exercise 2.2. Verify that y = (1/ci) sinci(s + C2) is a solution to Eq. (2.2.8), and 
verify that the choices ci = 1 and C2 = are appropriate given the boundary conditions. 



Our final result is this: The curve that maximizes the area A is described by 
the parametric relations 

x(s) = — coss, y(s) = sins, < s < tt. (2.2.9) 

This is a half-circle of unit radius that links the points (—1,0) and (+1,0). The 
maximum area is then given by 

f* dx 7T 

Ana X = j Vi 3 )-^ ds = j sin 2 sds = - ~ 1.5708. 

To test whether this is really a maximum we evaluate A for a different choice of 
curve, one which consists of two straight segments. The first segment connects the 
points (—1,0) and (0,yo), while the second segment connects the points (0,yo) and 
(1,0). The length of each segment is I = \J\ + y 2 . Because the total length of 
the curve must be equal to tt, we must set yo = \J (tt/2) 2 — 1. The area under this 
curve is the area of a triangle of base 2 and height y , so 

A = \{2){y ) = v/(tt/2) 2 - 1 ~ 1.2114. 

This area is indeed smaller than A max . 

2.2.4 Path of minimum length 

The calculus of variations, introduced in Sec. 2.2.2, can be employed to solve many 
different problems involving either the maximization or minimization of a functional. 
A simple example is the problem of finding the curve y(x) that minimizes the 
distance between two fixed points in the x-y plane. We already know that the 
answer is a straight line, but it will be comforting to use the calculus to give a 
mathematical proof of this statement. 
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We shall take the two points to be (0,0) and (xi,yi), respectively. We want to 
calculate the distance s measured along the curve y(x), and we want to find the 
path y(x) that minimizes this distance. The increment of distance ds along the 
curve is easy enough to calculate; it is given by 

ds = \J dx 2 + dy 2 = \J\ + (dy/dx) 2 dx = \Jl+y' 2 dx, 

where we have set y' — dy/dx. The total distance along the curve is obtained by 
integration. We have 

f-Xi 



f 1 y/1 + y' 2 dx. (2.2.10) 
Jo 



This is a functional of the path y(x), and we wish to minimize this functional. So 
here s plays the role of A[y), and x plays the role of the old parameter s. The 
function G is given by 

G(y,y') = Vl + y' 2 - (2.2.11) 

Notice that this depends only on y'; there is no explicit dependence on y. 
The EL equation for this situation is 

d dG _dG _ 
dx dy' dy 

Because G does not depend explicitly on y we have that dG/dy — 0. The EL 
equation implies 

±d_G =Q ^ 
dx dy' 

and this states that the quantity dG/dy' is in fact a constant, independent of x. 
We shall call this constant c. Calculating dG/dy' gives y' /V^ + y' 2 , and we have 
obtained the statement 



This equation can easily be solved for y', and we get 

c 



y' 



where m is a new constant. Integration of this equation is straightforward, and we 
obtain 

y(x) = mx + b, 

where b is a final constant of integration. This is the equation of the straight line, 
the result we expected. 

The constants m and b can be determined from the boundary conditions, y(x = 
0) = and y(x = x±) — y\. The first condition implies 6 = 0, while the second 
condition implies m = y\/x\. The final result is therefore that the path which 
minimizes the distance between (0,0) and (x\,yi) is described by 

y(x) = — x. (2.2.12) 
x\ 



That this is indeed a minimum, instead of a maximum, is obvious from the fact 
that the maximum distance between two points is always infinite. 
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2.2.5 Brachistochrone 

In this application of the calculus of variations we consider a particle released from 
rest on a slide of a specified shape. The particle is subjected to gravity, and it 
moves on the slide without friction. It eventually reaches the point (x\,zi) in a 
time t, as illustrated in Fig. 2.5. We wish to determine the shape of the slide that 
minimizes this time. This classic problem of mathematical physics is called the 
brachistochrone; it was first solved by Johann Bernoulli in 1696. 

The shape of the slide is specified by the unknown function x(z); this curve in 
the x-z plane is required to link the points (0,0) and (x\,z\). The increment of 
length on the curve is given by 

ds = \/ dx 2 + dz 2 = \f\ + (dx/dz) 2 dz = \/l + x' 2 dz, 

where we now use x' to denote dx/dz. The speed of the particle on the slide is 
v = ds/dt, and the increment of time is given by dt — ds/v. The total time 
required by the particle to reach the point (x\, z\) is then 



, ds f Zl Vl + x' 2 , 
t = / — = / ^- — dz. 



v(z) 

To calculate v(z) we appeal to the conservation of mechanical energy. In this 
situation the particle moves under the action of gravity, and its total energy is 
E = hmv 2 — mgz. It is stated that the particle proceeds from rest (v = 0) at the 
upper point of the slide (z = 0), and we conclude from this that its total energy is 

b 



zero. As a consequence we find that \mv 2 = mgz, or v(z) — \/2gz. We therefore 
have 



t - — f 1 ^ l + — dz 

V^g Jo Vz 



and the functional that we wish to minimize is 



/I T T /2 

2gt[x]= I r dz. (2.2.13) 



Here the role of the parameter is played by z, and the function G is given by 

G(x,x') = V t. . (2.2.14) 
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Notice that this depends only on x'; there is no explicit dependence on x. Notice 
further that there is an explicit dependence on the parameter z. 
The EL equation for this situation is 

d_dG_ _dG__ ,.. 
dz dx' dx 

We have dG/dx = 0, and we conclude immediately that 

dG 1 
constant 



dx' y/2a' 
(It turns out to be convenient to make this choice of constant.) Calculation gives 

dG _ x' 

and the EL equation reduces to 



zV± + x' 2 V2a 
This can easily be solved for x', and we obtain 



x' = 



\j2az — z 2 

This equation, finally, can be integrated, and a formal solution to our problem is 

x{z)= f -pL= (2.2.15) 
Jo V laz — £ z 

It is this integral that determines the shape of the minimal slide. 

Exercise 2.3. Make sure that you can reproduce the steps that lead to Eq. (2.2.15). 

To evaluate the integral of Eq. (2.2.15) we change the variable of integration 
from z to 9 using the transformation 

z = a(l — cos 9), 

which implies dz = a sin 9 d9. The angle 9 runs from when z = to 9\ when 
z = z\. After a short calculation we find that laz — z 2 = a 2 sin 2 9, and it follows 
that 

x = a f (1 - cos 6)d6 = a{6- sin 9). 
Jo 

The shape of the slide is therefore described by the parametric equations 

x(9) = a(9- sin 9), z{9) = a(l - cos 9) 0<9<9 1 . (2.2.16) 

These describe a curve known as a cycloid. The constants a and 6\ are determined 
by the condition that x = X\ and z = Z\ when 9 = 9\. For example, if we choose 
Xi = 5 and z\ — 1, then we need a ~ 0.89483 and 9\ ~ 4.5946. This particular slide 
is shown in Fig. 2.6. The figure reveals that contrary to expectations, the slide does 
not always go down; it indeed turns around when 9 = tt ~ 3.1416. 



Exercise 2.4. Make sure that you can reproduce the steps that lead to Eq. (2.2.16). 
Check that the constants a ~ 0.89483 and 6i ~ 4.5946 do indeed produce x\ = 5 and 
z\ — 1. Can you devise a method to determine a and 9i given a choice for xi and 21? 
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1 2 3 4 5 6 



x 

Figure 2.6: A cycloid that connects the points (0,0) and (5, 1). 



This feature of the minimal slide is surprising. Can we be sure that this slide 
truly minimizes the time? Would not a straight slide do a better job? To convince 
ourselves that we do have the minimal slide, let us compare the times required for 
the particle to go from (0, 0) to (5, 1) when it uses either the cycloid or a straight 
slide. We shall calculate \/2gt[x] for each case and compare the answers. 

For the cycloid we have 

f Zl y/l + x' 2 

2<?t C ycloid — / 7= dz. 

JO \ z 

With the change of variables introduced above we have x' = (dx / dff) / (dz j d9) = 
(1 — cos 9) j sin 9, so that 



- X 



yW^ + q-cos^ _ y2(r 



cosf 



sin 9 sin 9 

It follows that 

f ? y2(l-cosfl) asmOdO p— f Bl 

^cycloid - / —„ , = = V la \ (W, 

Jo sw.6 Va[l - cos 9) J 

or 

y^cycioid = \ 2a6x ~ 6.1466, 

using the numerical values listed previously. 

The shape of the straight slide is described by x — 5z, which implies that x' = 5. 
In this case we have 



f 1 \/26 

y^straight = / dz = 2\/26z 1/2 

Jo V z 



or 



2<?i s trai g ht = 2V26 ~ 10.198 



2.2 Calculus of variations 
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and this is a larger number. 

We have found that, sure enough, t cy cioid < Straight- The particle spends less 
time on the cycloid than on the straight slide, in spite of the fact that loses speed 
on the way up toward (x\, z\). The reason is that it picks up a lot of speed on the 
way down, and this more than makes up for the loss of speed on the way up. The 
straight slide just does not measure up. 



2.2.6 Multiple paths 



It is useful, and necessary, to generalize the calculus of variations to functionals 
A that depend not on one path only, but on a collection of paths. In this final 
subsection we consider the task of extremizing the multi-path functional 



G{y u y' l \y2 1 y' 2 \---)ds 



1,2, 



(2.2.17) 



is used to label 



with respect to each individual path y a (s); the index a 
each path within the collection. 

This generalization is straightforward. For each variable y a {s) within the col- 
lection we select a reference path y a (s) and we calculate A[y~i, y~2, • • •]■ We then 
displace each path from y a (s) to y a (s) + Sy a (s) and calculate the new value A[y\ + 
Syi,y2 + 5y 2 , ■ ■ •] for the functional. The extremum of A is found by demanding 
that the variation 5 A = A[y~i + Syi,y 2 + Sy 2 , ■ ■ ■]— A[yi,y 2 , ■ ■ •] vanish to first order 
in the displacements Sy a (s). As before we impose that the reference and displaced 
paths all begin and end at the same end points, y a (so) and y a {si). We therefore 
impose that the variations Sy a all vanish at the end points, Sy a {so) = = 5y a (si). 

The change in functional that occurs when we displace the paths from the ref- 
erence paths y a (s) is 



SA = 



G(yi + Syi,y[ + Sy' 1 ;y 2 + 5y 2 ,y 2 + Sy 2 ; ■ ■ •) - G(y 1 ,y 1 ';y 2 ,y' 2 ; ■ ■ •) 



ds. 



By Taylor's theorem, 

G(yi +Sy 1 ,y[ +Sy' 1 ;y 2 +Sy 2 ,y 2 + Sy' 2 ; ■ • •) = G(yi,m';m,y2, ' ' 



+ 



dG 
dyi 
dG 
dyi 



V-L=V\,V\=V\\Vi =V2 ,v 2 =v 2 ; 



=U2-y 2 =v 2 : 



Syi + 



Sy2 



dG 

dG 
dg 



vi=yi ,y 1 =v 1 -y2=v2 ,v 2 =y 2 \ 



Sy' 2 



Here G is differentiated with respect to each one of its variables, and the partial 
derivatives are evaluated on the reference paths; we discard all terms that are not 
linear in the displacements 5y a and Sy' a . We have, in a more compact notation, 



5A 



E 



dG 
dy a 



Sy a 



dG 

Wa 



where we sum over all the variables and omit the warning that all partial derivatives 
must be evaluated on the reference paths, at y a — y a and y' a = y' a . 
We now write 

S Va = y'a - V'a = J- s (Va - Va) = ^JVa 



ds 



and express the second term within the integral as 

Wa d{SVa)= iWa SVa )- Sya iWa 
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Integrating this term by parts gives 
dG 



dG d dG 



dy a ds dy' a 



Because the displacements must vanish at the end points, the two boundary terms 
disappear. And because each displacement Sy a (s) is independent of any other dis- 
placement, and the displacements are arbitrary in the interval so < s < si, we 
conclude that 

SA = => *™-™ =0 . (2.2.18) 
ds dy' a dy a 

We have one EL equation for each path y a {s). This simple statement provides the 
desired generalization of the calculus of variations to multi-path functionals. 



2.3 Hamilton's principle of least action 

In Chapter 1 we saw that Newton's law, F = ma, can serve as the very foundation 
of all of mechanics; conservation of momentum, angular momentum, and energy 
could be derived as a consequence of this dynamical law. In this section we offi- 
cially replace this old foundation by a new one, which is at once more practical, 
more powerful, and more easily gcneralizablc to other areas of physics. This new 
foundation will be Hamilton's principle of least action; the dynamical law F — ma, 
and the statements of conservation, will all be derived as consequences of this new 
principle. 

The principle of least action states that of all the paths q a (t) that a system of 
particles could take to go from an initial configuration q a (to) to a final configuration 
q a (ti), the paths q a (t) that the particles actually take are the ones that minimize 
the action functional ^ 

S[q a ] = / L(q a ,q a )dt, (2.3.1) 
J t 

where 

L = T —V (2.3.2) 

is the Lagrangian function of the mechanical system. The Lagrangian is the differ- 
ence between T, the system's total kinetic energy, and V, the total potential energy. 
The Lagrangian can be expressed in any system of generalized coordinates q a that 
conveniently describe the system's degrees of freedom. Because the Lagrangian is 
a scalar function (as opposed to a vectorial function), the choice of coordinates is 
immaterial to the formulation of Hamilton's principle. In particular, it is not nec- 
essary to adopt Cartesian coordinates attached to an inertial frame. (Of course, 
nothing prevents us from making this choice if it is convenient.) 

To find the paths q a (t) that minimize the action functional we follow the tech- 
niques developed in Sec. 2.2. Here S[q a ] is a multi-path functional, and the paths 
q a {t) play the role of the functions y a (s); the Lagrangian plays the role of the func- 
tion G, and the parameter is the time t. There is no need to repeat the calculations 
described in Sec. 2.2.6; the conclusion is 

SS = => "^-^=0. (2-3.3) 
dt dq a dq a 

These are the Euler-Lagrange (EL) equations for the mechanical system; there is 
one EL equation for each degree of freedom. The EL equations, when fully worked 
out, become a set of second-order differential equations for the paths q a {t). The 
solutions to these equations, which much be subjected to the boundary conditions 
at t = to and t = t\, are the paths q a (t) that minimize the action functional. 



2.4 Applications of Lagrangian mechanics 
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We have already seen in Sec. 2.1 that when the generalized coordinates q a {t) of 
a particle are Cartesian, so that the Lagrangian takes the form L — \m{x 2 + y 2 + 
z 2 ) — V(x, y, z), then the EL equations become the vectorial equation ma + 'VV = 0. 
Recalling that the force acting on the particle is F = — VV", this is obviously 
F = ma, and we have derived Newton's fundamental law from a deeper principle, 
Hamilton's principle of least action. The beauty of this Lagrangian formulation of 
mechanics, however, is not so much that Newton's equation follows from a deeper 
principle. Its beauty is much more in the fact that Hamilton's principle frees us 
from the need to always set up the equations in terms of Cartesian coordinates. Any 
system of generalized coordinates q a (t) will do; they all lead to the EL equations of 
Eq. (2.3.3), and the choice is entirely one of convenience. 

In the following sections we will explore the power of Hamilton's principle in a 
number of applications. We will take full advantage of the generalized nature of 
the coordinates q a (t), and the EL equations will allow us to derive the equations of 
motion very efficiently, with far less effort than would be required in the traditional 
Newtonian formulation. 

2.4 Applications of Lagrangian mechanics 

2.4-1 Equations of motion in cylindrical coordinates 

As was just stated, the principal advantage of the Lagrangian formulation of me- 
chanics is that it is based on a scalar function L which can be expressed in any 
coordinate system whatever. We shall begin our discussion with a derivation of the 
equations of motion in cylindrical coordinates; the case of spherical coordinates will 
considered next. 

Suppose that a particle moves in the presence of a potential V that is most 
simply expressed in terms of cylindrical coordinates (p, <p, z). These are related to 
the usual Cartesian coordinates (x,y,z) by 

x = pcos(j), y — pshitfi, z — z. (2-4.1) 

To use cylindrical coordinates would be advantageous, for example, when the po- 
tential is axially symmetric, so that it depends only on p and z, or cylindrically 
symmetric, when it depends only on p. 

From Eq. (2.4.1) we obtain the total differentials 

dx — (cos (j)) dp — (p sin 0) d<j>, 
dy = (sin <f>) dp + (p cos (p) d<f>, 
dz = dz. 

It follows that the squared distance between two neighbouring points is given by 
ds 2 = dx 2 + dy 2 + dz 2 , or 

ds 2 = dp 2 + p 2 dc/} 2 + dz 2 . (2.4.2) 

The squared velocity is then 

2 /ds\ 2 . 2 212 i -2 
V =(-] =p +p<b +z, 

and the particle's kinetic energy is T — \m{p 2 + p 2 <j) 2 + z 2 ). The Lagrangian is 
therefore 

L(p, p; <f>, 0; z, i) = l -m{p 2 + p 2 ^ 2 + i 2 ) - V(p, 0, z). (2.4.3) 
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Exercise 2.5. Verify Eq. (2.4.2). 



The equations of motion for the particle are obtained by substituting L into the 
EL equations for q a = (p,(f>,z). We begin with the equation for p. We have, from 
Eq. (2.4.3), 

dL 



This implies 



We also have 



-wr = rap. 
dp 



d dL 



dL ■, dV 

— = mpcj) - — , 
dp dp 



and the EL equation gives 



dV 

mp - mpcj) 2 + — = 0. (2.4.4) 
dp 

We continue with the equation for <j>. We now have 

—r = mp (f>, 

d(p 

which implies 

d dL d ( 2 - \ 
— = m— [ p 6). 

dt dj> dt V Y J 

Notice that we choose to leave the total time derivative unevaluated; to evaluate it 
would require some care, because both <j> and p 2 depend on time in this expression. 
We also have 

dL _ dV 

and the EL equation gives 

d ( 2 a dV 
'dA P V + dj 



■ t" 2 ''') ' do (2A5) 



We conclude with the equation for z. Here the computations are quite easy. We 
have 

dL 

dI =mZ > 

so that 

d_dL_ _ .. 
dt dz 

and we also have 

dL _ _dV 
dz dz ' 



The EL equation for z is therefore 



dV 

mil - =0. (2.4.6) 

dz 
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The equations of motion (2.4.4)-(2.4.6) could also be derived by resolving New- 
ton's equation F = ma in the vectorial basis (p,(j>,z). The results would be 
identical, but the computations would be much more laborious. 



Exercise 2.6. Challenge yourself: Derive Eqs. (2.4.4)-(2.4.6) the hard way, as described 
in the previous paragraph. Begin by computing the acceleration vector a in terms of the 
cylindrical coordinates (p, <j>, z). Next, find the basis vectors p, <j>, and z using the method 
outlined in Sec. 1.2. Finally, resolve the equation ma + W = in this basis, and use the 
chain rule to calculate dV/dp and dV/d<j> in terms of dV/dx and dV/dy. The end result 
should resemble Eqs. (2.4.4)-(2.4.6). If you are not already, after all this you will be fully 
convinced of the superiority of the Lagrangian methods! 



2.4-2 Equations of motion in spherical coordinates 

Suppose now that a particle moves in the presence of a potential V that is most 
simply expressed in terms of spherical coordinates (r, 9, <f>). Their relation with the 
usual Cartesian coordinates (x, y, z) is 

x = r sin 9 cos <f), y — r sin 9 sin <fi, z — r cos 9. (2.4.7) 

The use of spherical coordinates would be advantageous, for example, when the 
potential is axially symmetric, so that it depends only on r and 9, or spherically 
symmetric, when it depends only on r. 

From Eq. (2.4.7) wc obtain the total differentials 

dx = (sin 9 cos </>) dr + (r cos 9 cos <j>) d9 — (r sin 9 sin</>) d<f>, 
dy = (sin 9 sin <f>) dr + (r cos 9 sin <f)) d9 + (r sin 9 cos <f>) d<j>, 
dz = (cos 9) dr — (r sin 9) d9. 

It follows that the squared distance between two neighbouring points is given by 

ds 2 = dr 2 + r 2 d9 2 + r 2 sin 2 9 dej) 2 . (2.4.8) 

The squared velocity is then v 2 = r 2 + r 2 9 2 + r 2 sin 2 9 4> 2 , and the Lagrangian is 

L(r, r; 9, 9; <f>, <f>) = ^m{r 2 + r 2 9 2 + r 2 sin 2 9 <j> 2 ) - V(r, 9, 0). (2.4.9) 



Exercise 2.7. Verify Eq. (2.4.8). 



The equations of motion for the particle are obtained by substituting L into the 
EL equations for q a = (r, 9, <fi). We have, from Eq. (2.4.9), 

dL 

— = mr, 
or 

so that 

d dL 

dt^r^™- 

We also have 

— = mr(9 2 + sm 2 9 <j) 2 - — , 
or Or 



and the EL equation for r is 



dV 

mr - mr(9 2 + sin 2 9cf> 2 ) + — = 0. (2.4.10) 
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Moving on, we now have 

— - = mr 9, 
88 

which implies 

d dL d ( 2 -\ 

r = m—\r l 9\. 

dt d6 dt\ ) 

We also have 

— = mr sm9cos9(f> - — , 

and the EL equation gives 

d ( oaN o . „ „ ;9 



Finally, we have 



which implies 



We also have 



m—(r 2 9) - mr 2 sin 6 cos 6 4> 2 + — = 0. (2.4.11) 
dt\ J o0 

dL 



mr 2 sin 2 9 <f>, 



d dL d ( 2 . 2 • \ 
= "i— ^ sin 



dL _ _dV 
~d4> ~ ~~d~0' 



and the EL equation gives 



,. ( / ;! s:i: J «, : ;) +-^r=0. (2.4.12) 



The equations of motion (2.4.10)-(2.4.12) could also be derived by resolving 
Newton's equation F = ma in the vectorial basis (f,0, </>). The results would be 
identical, but as in the preceding subsection the computations would be much more 
laborious. 



Exercise 2.8. Challenge yourself once again: Derive Eqs. (2.4.10)-(2.4.12) the hard 
way, as described in the previous paragraph. Or finally cry uncle and pledge allegiance to 
the Lagrangian way of life! 



2.4-3 Motion on the surface of a cone 

As our first real application of the Lagrangian formalism, we consider a particle 
that is constrained to move on the surface of a cone, subjected to gravity. As shown 
in Fig. 2.7, the cone has an opening angle of 2a, and it is placed vertically in the 
gravitational field. The particle is at a distance r(t) from the cone's apex, and at 
an angle <fr(t) relative to the x axis. Because the particle is confined to the cone's 
surface, its angle 9 with respect to the z axis is a constant; it is in fact equal to a. 

The motion of the particle is best described in terms of spherical coordinates 
(r,9,<f>), with 9 restricted at all times to the value a. According to the results of 
Sec. 2.4.2, its kinetic energy is T = \m{f 2 +r 2 sin 2 acf) 2 ), and its potential energy 
is V = mgz = mgr cos a. The Lagrangian is therefore 

L(r, r; (f), 4>) = -m(r 2 + r 2 sin 2 a <j) 2 ) — mgr cos a. (2.4.13) 

The equations of motion for r{t) and <p(t) are obtained by substituting this La- 
grangian into the EL equations. 
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We have 

dL 

ttt = mr, 
or 

so that 

d dL _ 
dt df 

We also have 

° L ■ 2 12 

— — = mr sm a <p — mg cos a, 
or 

and the EL equation for r is 

f - rsin 2 a0 2 +gcosa = 0. (2.4.14) 

Moving on, we observe that L is independent of <j>, and the fact that dL/d(f> = 
means that the EL equation for <f> reduces to 

This implies that the quantity dL/d(j) is a constant, which we shall call mh. Calcu- 
lating the partial derivative gives dL/dcj) = mr 2 sin acj), and we finally obtain the 
statement 

r 2 sin 2 acj) — h — constant. (2.4.15) 

The quantity h is readily interpreted as the z component of the particle's reduced 
angular momentum vector, and it is a constant of the motion. Equation (2.4.15) 
shows that (f> is always of the same sign; the angular part of the motion is monotonic. 
Substituting <j) — h/(r 2 sin 2 a) into Eq. (2.4.14) produces 

h 2 

r „ h g cos a = 0. 

r 3 sin a 

This equation can be integrated by using the standard trick of multiplying each 
term by r (recall that we used this trick back in Sec. 1.5.6). We have 

h 2 f 

rr „ h gr cos a = 0, 

r 3 sin a 
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Figure 2.8: Energy diagram for a particle moving on a cone. The motion always takes 
place between two turning points at r = r±. 



or 



d (\ 



-7—2 h gr cos a 



dt\2 2r 2 sin 2 a 
This finally gives us the conservation statement 

1 -2 

—r+u(r)=e = constant, 

where e is the particle's reduced total mechanical energy, and 

h 2 



v{r) 



2r 2 sin a 



gr cos a 



(2.4.16) 



(2.4.17) 



is an effective potential for the radial part of the motion. Equations (2.4.16) and 
(2.4.17) give rise to the energy diagram of Fig. 2.8. From this diagram we immedi- 
ately conclude that the motion takes place between two turning points at r = r±; 
these are determined by the condition v(r±) = e. 

To obtain a full picture of the motion Eqs. (2.4.14) and (2.4.15) must be inte- 
grated numerically. Results of such a numerical integration are presented in Fig. 2.9. 
To carry out these integrations the equations are recast into the following set of 
first-order equations: 



h 



g cos a, 



9-2 ' 

r z sm a 



where we have introduced the auxiliary variable v. We start the integration at 
r = r_, setting v — (as we must) and <fi = 0. The constant h can be determined 
in terms of r_ and r + by using the relation v(r-) = v{r + ), which follows from 
Eq. (2.4.16). The result is 



h 2 = 2g sin a cos a 
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-3 1 ' 1 1 1 1 1 

-3-2-1 1 2 3 

x = r sin(alpha) cos(phi) 



Figure 2.9: Numerical integration of the equations of motion for a particle moving on 
the surface of a cone. To produce these results we have chosen a = 0.8, r_ = 1, and 
r+ = 4. The upper panel shows the raw results for r(t), r(t), and 4>{t). Notice that the 
radial velocity is zero whenever r — r± and that it oscillates between positive and negative 
values. Notice also that <j> is always positive; it is maximum whenever r — r_ and minimum 
whenever r — r+. The lower panel shows the projection of the particle's motion in the 
x-y plane. To obtain this we let x{t) = r(t) sin a cos <j>(t) and y(t) — r(t) sin a sin (j>(t). The 
motion proceeds counterclockwise and the figure is that of a regressing ellipse. 
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Figure 2.10: The motion of a spherical pendulum is described in terms of the angles 
d(t) and 4>{t). 



Exercise 2.9. Verify the quoted relation between h 2 and r±. 



2-4-4 Spherical pendulum 

We now examine the situation of a pendulum which is free to move in all directions 
about its pivot point. The pendulum has a mass m, a constant length £, and its 
motion is described in terms of the two angles 9(t) and <p(t), as shown in Fig. 2.10. 
These coordinates are related to the standard Cartesian coordinates by 



£ sin 9 cos < 



1J 



£ sin 9 sin < 



I cos 9. 



As shown in the figure, the z axis is pointing down, in the direction of the grav- 
itational acceleration g. It is clear that we are once more dealing with spherical 
coordinates. This time, however, it is the radial coordinate r that is held fixed to 
the value £. According to the results of Sec. 2.4.2 the pendulum's kinetic energy 
is T = ^m£ 2 (9 + sin 2 9<j) 2 ). Its potential energy is V — —mgz = —mg£ cos 9 = 
—m£ 2 uj 2 cosf?, where we have re-introduced the quantity 



The pendulum's Lagrangian is 



(2.4.18) 



1 



L{9,9-(f),(t>) = ^m£ 2 (9 2 + sin 2 9 4> 2 ) +jti£ 2 lu 2 cos I 



(2.4.19) 



The equations of motion for 9(t) and <p(t) are obtained by substituting this La- 
grangian into the EL equations. 
We compute 

S - m£ 2 9\ 
09 
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which implies 

r = mte. 

dt 89 

We also have 

—— = m^ 2 sin 6 cos 9 <j? — m£ 2 u> 2 sin 0, 

and the EL equation for 9 is 

9 -sin9cos9(j) 2 +LU 2 sin9 = 0. (2.4.20) 

Moving on, we observe that L is independent of cj>, and the fact that dL/dcj) = 
means that the EL equation for <j> reduces to 

This implies that the quantity dL/dcj) is a constant, which we shall call m£ 2 h. 
Calculating the partial derivative gives dL/dcj) = mi 2 sin 9 cf>, and we finally obtain 
the statement 

sin 2 9 cj> = h = constant. (2.4.21) 

The quantity h is once more interpreted as the z component of the pendulum's 
reduced angular momentum vector, and it is a constant of the motion. In the 
special case h = the pendulum is prevented to move in the cj> direction, and 
Eq. (2.4.20) for 9 reduces to 9 + uj 2 sin 6* = 0; this is the same equation that was first 
derived in Sec. 1.3.7, and then again in Sec. 2.1, and which describes the motion of 
a planar pendulum. In the general case (h ^ 0) we see that <j> is always of the same 
sign, so that cj>(t) is a monotonic function of time; this means that the pendulum 
rotates in a consistent direction around the z axis. 

With the substitution <j> = /i/sin 2 9 Eq. (2.4.20) becomes 

•• h 2 cos 9 , . n 

9 5 V w 2 sin6» = 0. 

shr 9 

Multiplying each term by 9 allows us to integrate this equation. The result is the 
conservation statement 

-9 2 + v{0) = e = constant, (2.4.22) 
where e is the pendulum's reduced total mechanical energy, and 

h 2 

v{6) = ^ lo 2 cos9 (2.4.23) 

2 sin 9 

is an effective potential for the motion in the 9 direction. Equations (2.4.22) and 
(2.4.23) give rise to the energy diagram of Fig. 2.11. From this diagram we may 
immediately conclude that the motion takes place between two turning points at 
9 = 9±; these are determined by the condition v{6±) = e. 

Exercise 2.10. Verify that Eqs. (2.4.22) and (2.4.23) do indeed follow from the 
equations of motion. 

In Fig. 2.12 we present the results of a numerical integration of the equations of 
motion, which we recast into the first-order form 

• h 2 cos9 2 . ■ h 

9 = v, v= 5 w smv, cp = — . 

sin 3 6» sin 2 9 
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e 

Figure 2.11: Energy diagram for the spherical pendulum. The motion always takes 
place between two turning points at 9 — 9±. 

We start the integration at 9 = 0_, setting v = and = 0. The constant h can 
be determined in terms of and 9 + by using the relation v(6J) = v{9 + ) 1 which 
follows from Eq. (2.4.22). The result, after some algebra, is 

i2 2 2 ( Sm ^+ Sm ^-) 2 ( C0S ^- _ C0S ^+) 

(sin 9 + — sin ) (sin 9 + + sin 9- ) ' 



Exercise 2.11. Verify the quoted relation between h 2 and 8±. 



2.4-5 Rotating pendulum 

Another variation on the pendulum theme has the pivot point of a planar pendulum 
forced to rotate with a constant angular velocity Q on a circle of radius a. This 
situation is shown in Fig. 2.13. Once more we describe the motion of the pendulum 
in terms of the swing angle 9(t), which is defined relative to the vertical direction; 
this we now associate with the y-direction. 

The Cartesian coordinates of the pendulum, relative to the pivot point, are 

^relative = i Sin 0, ^relative = ~£ COS 9. 

The Cartesian coordinates of the pivot point are 

^pivot = acosfii, 2/pivot = a sin fit. 

The Cartesian coordinates of the pendulum, relative to the origin of the coordinate 
system, are therefore 



x = a cos fit + £ sin 9, y = a sin fit — £ cos 9. 



(2.4.24) 
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Figure 2.12: Numerical integration of the equations of motion for a spherical pendulum. 
To produce these results we have chosen (9_ — 0.2, 6+ = 2.5, and set u) — 2n. The 
upper panel shows the raw results for 6{t), 0(t), and (j>(t). Notice that 6 is zero whenever 
6 = 9± and that it oscillates between positive and negative values. Notice also that 
4> is always positive; it reaches a local maximum whenever 9 — 9±. The lower panel 
shows the projection of the pendulum's motion in the x-y plane. To obtain this we let 
x(t) = sin 9(t) cos cj>(t) and y(t) = sin 9(t) sin <j>(t). The motion proceeds counterclockwise. 
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Figure 2.13: The motion of a rotating pendulum is described in terms of the swing 
angle 6(t). The pivot point rotates with a constant angular velocity Q on a circle of radius 



The components of the velocity vector are 

x = —afl sin fit + £9 cos 6, y = afl cos fit + £9 sin 9. 

The squared velocity is then calculated as 

v 2 = x 2 + y 2 = (ail) 2 + 2a£fl9 sin(0 - fit) + £ 2 9 2 , 

and the kinetic energy of the pendulum is T = \mv 2 . Its potential energy is 
V = mgy = mg{a sin fit — £ cos 9) . 

Exercise 2.12. Verify the preceding result for v 2 . 



The Lagrangian of the rotating pendulum is, finally, 

L(0,0;t) = im (an) 2 +2am9sm(9-nt)+£ 2 9 2 - rnlu 2 (a sin fit- i cos 0), (2.4.25) 

where we have once more introduced lo 2 = g/t, A new feature of this Lagrangian is 
that it depends explicitly on time; this comes about because the pendulum is not left 
alone to its own devices, but is instead acted upon and forced to follow a rotational 
motion. In this circumstance we cannot expect the energy of the pendulum to be 
conserved: There will be at all times a transfer of energy between the pendulum and 
the external agent that is responsible for the rotational motion. Globally the total 
energy is conserved, but the energy of the pendulum is not individually conserved. 
To obtain the equation for motion we must first calculate 



dL 
9^ 



ma£flsm(9 - fit) + m£ 2 9. 



Differentiating this with respect to time gives 



d_ dL 
dt 89 



ma£flcos(9 - fit) (9 -fl) + m£ 2 
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We next compute 



dL 

— = ma£Sl6 cos(6 — Qt) — ml 2 J 2 sin 6, 
o9 



and substituting all this into the EL equation produces 

9 + lj 2 sin 6 - (a/£)fl 2 cos(8 - Qi) = 0. (2.4.26) 

This is the equation of motion of a rotating pendulum. 

Equation (2.4.26) cannot be integrated with the help of the 9 trick; this is 
prevented by the fact that the equation depends explicitly on time through the 
term in cos(# — fit). As a consequence, the motion cannot be analyzed with the 
help of an energy diagram; this can be understood from the very fact that the total 
mechanical energy of the rotating pendulum is not conserved. The only tool that 
remains at our disposal to analyze the motion is numerical integration, and Fig. 2.14 
displays the results. 

The graphs reveal that when the pendulum is driven at a frequency f2 that is 
close to its natural frequency u>, the response is more violent: the amplitude of 
the oscillations is then much larger. This is the phenomenon of resonance. This 
phenomenon can be illustrated in the context of a simpler model, one which can 
be solved exactly. We consider a simple harmonic oscillator which is driven by an 
oscillating external force. The equations of motion for this simplified model is 

9 + lu 2 9 = AcosQt. (2.4.27) 

When ft ^ lo a solution to this equation is 

A 

Q(t) = — —cosVLt (Vt^to). (2.4.28) 

lo 1 — \ V 

In this situation the pendulum oscillates at the driving frequency fi, and the oscil- 
lations have a constant amplitude. Notice, however, that the amplitude grows as ft 
approaches the natural frequency lo. The solution of Eq. (2.4.28) is not valid when 
O = lo. In this case we have instead 

At 

9(t) = — sintot (n = cu). (2.4.29) 

ZLO 

In this case the oscillations keep growing in amplitude; the simple harmonic oscil- 
lator is in resonance with the driving force. 



Exercise 2.13. Verify that Eqs. (2.4.28) and (2.4.29) are solutions to Eq. (2.4.27). A 
more challenging question: What is the general solution to Eq. (2.4.27) when Q, ^ ui and 
when Q. — wl The general solution should be parameterized in terms of the initial angle 
8(0) and the initial angular velocity 0(0). What choices of initial conditions give rise to 
Eqs. (2.4.28) and (2.4.29)? 



When the rotating pendulum is driven at resonance we observe a growth in the 
amplitude of oscillations, but this growth is bounded; it saturates and the amplitude 
then starts to decrease. This saturation is produced by nonlinear effects: When 
the amplitude grows the natural period of the oscillations changes (as we learned 
back in Sec. 1.3.7) and the pendulum is no longer driven at its natural frequency. 
As resonance stops the amplitude starts to decrease and the pendulum's natural 
frequency returns to its original value. At this stage the conditions are once more 
suitable for a resonant growth of the amplitude, and the cycle repeats. 

For certain choices of parameters the driving force can have a dramatic influence 
on the pendulum. This is illustrated in Fig. 2.15, for which the driving frequency 
was set to O = 0.9lo. Here we see the driving force causing the pendulum to 
go beyond 9 — ir, completing one or two revolutions before returning to a short 
oscillation cycle. 
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Figure 2.14: The motion of a rotating pendulum. Each graph shows the swing angle 
6{t) of the driven pendulum in blue, and the swing angle of a free pendulum in red. In 
the first graph the pendulum is driven at a low frequency set at Sl/ui = 0.4. In the second 
graph the pendulum is driven at a high frequency set at Q/u> = 2.4. In the third graph the 
pendulum is driven at resonant frequency, so that il/ui = 1.0; notice the large amplitude 
of oscillations in this case. In all cases we have set (a/£)ft 2 = 0.2, and the initial conditions 
are 0(0) = 0.2 and 8(0) = 0.3. 
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Figure 2.15: The motion of a rotating pendulum, with 0,/uj = 0.9, (a/£)D. 2 = 0.2, 
0(0) = 0.2, and 0(0) = 0.3. 



2.4-6 Rolling disk 

As our next application we consider a disk of mass m and radius R that rolls without 
slipping on an inclined plane of total length £• the plane's inclination relative to the 
horizontal is a. As shown in Fig. 2.16, the distance from the top position on the 
plane to the disk's centre of mass — its geometric centre — is denoted s, and 9 is 
the angle of a selected point on the disk's rim relative to an axis perpendicular to 
the inclined plane. 

There is both a translational motion of the centre of mass and a rotational 
motion of the disk in this problem. The disk's kinetic energy is 

T = \ms 2 + \l9\ 

where I = ^mR 2 is the disk's moment of inertia. The coordinates s and 9, however, 
are not independent; they are related by the no-slip condition, which implies s = R9. 
So we have s = R9 and the kinetic energy becomes 

T = 1 mR 2 9 2 + -mR 2 9 2 = 3 mR 2 9 2 . 
2 4 4 

The disk's potential energy is V — mgz = mg{l — s) sin a = mg(£ — R9) sin a. 
The Lagrangian is therefore 

L(6, 9) = ^mR 2 9 2 - mg{£ - R9) sin a. (2.4.30) 

To obtain the disk's equation of motion we substitute this into the EL equation. 
We first compute 

— - = -mR z 9, 
39 2 

which implies 

d dL 3 o2 " 

r = -mR i 9. 

dt 09 2 
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(l-s)sina 



Oil 



Figure 2.16: A disk rolling without slipping on an inclined plane. The plane has a 
length I and its inclination angle is a. The distance from the disk's centre to the top of 
the plane is s; the height of the disk's centre is (I — s] sin a. 



We also compute 



The equation of motion is then 



dL 
80 



mgR sin a. 



- _ 2gsina 
3R ' 

and we find that the disk is under constant angular acceleration. 

If we assume that the disk started with zero angular velocity, then Eq. 
integrates to 

_ gsma 2 
d{t) "liT* ■ 

The time tbottom required for the disk to reach the bottom of the inclined 
determined by the condition 6>(£bottom) = tjR- Solving this gives 



/ 31 

^-bottom A / ■ 

U gsma 

Notice that ibottom is independent of R, the disk's radius. 



(2.4.31) 

(2.4.31) 

(2.4.32) 
plane is 

(2.4.33) 



2.4-7 Kepler's problem revisited 

As a final application of the Lagrangian formalism we will rederive the main equa- 
tions of Kepler's problem. As we shall see, the Lagrangian methods give a much 
more efficient way of obtaining these equations. 

As in Sec. 1.5.2 we express the position vectors r\ and r 2 of the two massive 
bodies in terms of the relative separation vector r = r\ — r 2 and the position R of the 
centre of mass, which is determined by MR = m\T\ + m2r 2 , where M = mi + rn 2 
is the total mass. We have r\ = R+ (m 2 /Af)r, r 2 = R— (mi/M)r, and after some 
algebra we find that the system's kinetic energy is 



T = 



1 



-m\T\ ■ T\ 



1 



-m 2 r 2 • r 2 



1 • 1 mi m 2 . 

= 2 MR - R+ 2^r*-*- 
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The system's potential energy, on the other hand, was calculated in Sec. 1.5.1, and 
it is given by 

_ GTO1TO2 
r ' 

where r = \r\ = \n — r 2 | is the distance between the two bodies. The system's 
Lagrangian is 

L(R, R; r, r) = ^MR ■ R + • r + (2.4.34) 

where 

M=^p, M = mi +m 2 (2.4.35) 
is known as the reduced mass of the two-body system. 

Exercise 2.14. Go through the algebra that leads to our previous expression for the 
kinetic energy in terms of R and f. 

Notice that the Lagrangian of Eq. (2.4.34) separates into two independent pieces. 
The first piece depends on R only, and is independent of r; this is the Lagrangian 
of the centre of mass, 

L CM {R,R) = ^MR-R. 

The second piece depends on r only, and is independent of R; this is the Lagrangian 
of the relative separation between the two bodies, 

r , .x 1 • • GflM 

L ie i(r,r) = -fir ■ r H . 

2 r 

Notice now that Lcm contains only a kinetic-energy term. The absence of a 
potential-energy term implies that the motion of the centre of mass is free. As 
a quick calculation will verify, the EL equations for R take the form R = 0, for 
which the solution is R(t) = R(0) + R(0)t. As we have seen in Sec. 1.5.2, the centre 
of mass moves freely, and it can be placed at the origin of an inertial frame. The 
relative Lagrangian, on the other hand, contains both a kinetic-energy term and a 
potential-energy term. It describes the motion of a fictitious particle of mass \x in 
the gravitational field of a central mass M, also fictitious. As we have witnessed 
before in Sec. 1.5.2, our original two-body problem has simplified into an effective 
one-body problem. 

To proceed we may switch from the Cartesian coordinates r = (x, y, z) to any 
system of generalized coordinates q a . Recalling from Sec. 1.5.3 that the motion takes 
place in the x-y plane (a fact that could be re-derived on the basis of Lagrangian 
mechanics), we adopt the polar coordinates (r, 4>), related to x and y by x = rcos(j) 
and y = r sin (f>. We have x = r cos <fi — rip sin <j>, y — r sin <f> + r<j> cos <f>, and it follows 
that 

f ■ f = x 2 + y 2 = f 2 + (r<j)) 2 . 



The Lagrangian therefore becomes 



1 



G[iM 



L rel (r,r;0,<A) = -fj, f + (r^) 2 + (2.4.36) 

A L J T 

Notice that this Lagrangian is actually independent of </>, a feature that was en- 
countered also in previous examples. 

To obtain the equation of motion for r we compute 

dL IC \ . d dL re \ 
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and 

-&r = ^ - —■ 

This gives 

r - rtf + ^ = 0. (2.4.37) 



This is the same statement as Eq. (1.5.25). To obtain the equation of motion for 
<j) we observe that since L rc \ is independent of 4>, we must have that dL le i/d<j> is 
a constant of the motion. Calling this constant fih and calculating the partial 
derivative, we get 

r 2 4> = h = constant. (2.4.38) 

This is the same statement as Eq. (1.5.27), and h is identified as the reduced angular 
momentum of the two-body system. 

The equations of motion (2.4.37) and (2.4.38) can be analyzed with the same 
mathematical techniques as those employed in Sec. 1.5. It should be clear that 
compared with the Newtonian methods of Chapter 1, the Lagrangian methods 
provide a much simpler way of obtaining these equations. 



2.5 Generalized momenta and conservation 

statements 

2.5.1 Conservation of generalized momentum 

In the applications of Lagrangian mechanics presented in Sec. 2.4 it occurred a 
number of times that the Lagrangian was independent of one of the generalized 
coordinates (mostly it was the 4> coordinate), and we saw that this fact always 
translated into the existence of a constant of the motion (which we usually called 
h). A specific example is the case of a particle moving on the surface of a cone 
(Sec. 2.4.3), for which the Lagrangian is indeed independent of <j) and for which the 
constant of the motion was h — r 2 sin 2 a <j). A similar situation occurred for the 
spherical pendulum (Sec. 2.4.4) and for Kepler's problem (Sec. 2.4.7). 

It is easy to generalize this discussion and to derive the very useful fact that 
whenever the Lagrangian does not depend explicitly on one (or more) of the gen- 
eralized coordinates q a , there exists a corresponding constant of the motion. To 
establish this statement we shall first introduce the notion of a generalized momen- 
tum. 

Consider a Lagrangian L(q a ,q a ) that depends on a number of generalized coor- 
dinates q a and a number of generalized velocities q a . The quantities 

Pa = (2.5.1) 

oq a 

feature prominently in the EL equations, which can be written in the form 

Pa = (2.5.2) 
dq a 

The quantities p a are the generalized momenta of the mechanical system. There is 
one generalized momentum p a for each generalized coordinate q a . 

The generalized momenta can represent cither a component of the linear- momentum 
vector or a component of the angular-momentum vector. Generally speaking, when- 
ever q a represents a linear variable the corresponding p a will be a linear momen- 
tum; and whenever q a represents an angular variable its corresponding p a will be 
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an angular momentum. Consider, for example, the Lagrangian of a free particle in 
cylindrical coordinates (Sec. 2.4.1). This is 



L= 1 -m(p 2 +p^ + z 2 ). 



The generalized momenta are 



dL 
dp 
dL 



P P = gr=mp, 



Pz 



dL 

dz 



In the case of p p and p z we clearly have quantities that represent components 
of a linear-momentum vector. But the case of p^ is different. Here we have 
Pcf, = m(p)(p<j>), and this clearly represents the component of an angular-momentum 
vector. 



Exercise 2.15. Show that p p — p ■ p and p z = p ■ z, where p is the particle's 
momentum vector. Show, on the other hand, that = L ■ z, where L is the particle's 
angular-momentum vector. 



Suppose now that a Lagrangian L(q\, qi; qi, • • ■) happens not to depend ex- 
plicitly on one of its generalized coordinates, say <?*. Then 

dq* 

and it follows from the EL equation for g* that 

at 

where p* = dL/dq* is the generalized momentum associated with the coordinate q*. 
This equation states that p* is a constant of the motion, and we have established 
the following theorem: 

Whenever the Lagrangian of a mechanical system does not depend ex- 
plicitly on a generalized coordinate q* , the corresponding generalized 
momentum = dL/dq* is a constant of the motion. 

A coordinate g* that does not appear in L is sometimes called a cyclic coordinate. 
A Lagrangian may contain any number of cyclic coordinates. 

As an example consider the following Lagrangian, again in cylindrical coordi- 
nates, 

L= l -m{p 2 +p 2 4> 2 + z 2 )-V{p). 

Here it is assumed that the potential energy V depends only on p; the mechanical 
system is cylindrically symmetric. This implies that <fi and z are cyclic coordinates, 
and that p^ = mp 2 <p and p z = mz are constants of the motion. 

This theorem on cyclic coordinates and conserved quantities is extremely im- 
portant and very useful. To find all the constants of the motion is usually a key 
step during the integration of the equations of motion, and the theorem provides a 
very efficient algorithm to identify at least some of them. 
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2.5.2 Conservation of energy 

Conservation of total mechanical energy E is also an important aspect of the motion 
of a mechanical system and a key to solving the equations of motion. In this 
subsection we show that energy is conserved whenever the Lagrangian does not 
depend explicitly on time t. 

To begin the discussion let us consider a Lagrangian L(q a , q a , t) that depends on 
a number of generalized coordinates q a , a number of generalized velocities q a , and 
let us consider the possibility that it depends also explicitly on time. (An example 
is the Lagrangian of a rotating pendulum, which was written down in Sec. 2.4.5.) 
Applying the chain rule, we find that the total time derivative of the Lagrangian is 
given by 

dL m—^ dL . ^ dL .. dL 

Tt = ^dq~a qa + ^dq~a qa+ ~di' 

The first term accounts for the time dependence contained in each q a (t), the second 
term for the time dependence contained in each q a (t), and the third term accounts 
for the explicit dependence of the Lagrangian on t. 
We have defined the generalized momenta p a by 

dL 

Pa = -7T7- 

dq a 

and the EL equations can be expressed in the form 

dL 
dq a 

We make these substitutions in the previous equation, and obtain 

dL -r-^ / . . .. \ dL 

a 

or 

dL d /v-^ . \ dL 
- d -t=dt\£ Paqa ) + W 

x a ' 

which is equivalent to the previous form by virtue of the chain rule. 
We have obtained the equation 




(2.5.3) 



and a statement of conservation follows immediately: 

Whenever L does not depend explicitly on time, so that dL/dt = 0, we 
have that 

h(q a , q a ) = ^Paia - L (2.5.4) 

a 

is a constant of the motion, dh/dt = 0. 

Surely the function h(q a , q a ) must have something to do with the system's total 
mechanical energy. Let us first figure out the relationship in the context of a sim- 
ple example. We go back to the Lagrangian of a particle expressed in cylindrical 
coordinates, 

L= l -m{p 2 + p 2 4> 2 + z 2 )-V{p,cj > ,z), 
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but this time we place no constraints on the potential energy. The generalized 
momenta are p p = mp, p<p = mp 2 <j), and p z — mz. We then have 

h = P P P + P<t>4> + Pz* ~ L 

= mp 2 + mp 2 <\> 2 + mz 2 - ^m(p 2 + p 2 (j) 2 + z 2 ) + V(p, (f>, z) 

= \m{p 2 +p 2 4> 2 + z 2 ) + V{p,^,z). 

This is indeed the total mechanical energy, the sum of kinetic and potential energies. 

To verify that h(q ai q a ) is always equal to the total mechanical energy we use 
the fact that the kinetic energy is usually a quadratic function of the generalized 
velocities, 

T = ^^2A ab q a q b . 

a, b 

The coefficients A ab may in general depend on the coordinates q a , and without loss 
of generality we may assume that A ba = A ab . The Lagrangian is then 



L = \ ^A ab q a q b - V{q a ) 



2 

a.b 

The generalized momentum p a is obtained by differentiating L with respect to q a . 
To see what this amounts to let us consider a special case in which the mechanical 
system possesses three degrees of freedom. In this case we have, explicitly, 

L = \ All< il + ^129192 + -4139143 + 7,A 22 q 2 + ^23^293 + ^3393 ~ V {<lU 92, 93)- 

It follows that 

FIT, 

= A n qi + A 12 q 2 + A 13 q 3 , 
= Ai 2 qi + A 22 q 2 + A 23 q 3 , 

= ^1391 + -42392 + A 33 q 3 
are the generalized momenta. These relations are summarized by 

Pa = ^A ab q b , 

b 

and the same expression is always obtained, regardless of the number of degrees of 
freedom. The function h is then 

h = ^2Pa4a-L 

a 

= ^2(^2A ab q b jq a - ^2,A ab q a q b + V{q a ) 

\ U / „ h 





ill. 


Pi = 


dqi 




dL 


P2 = 


dq 2 




dL 


P3 


dq 3 



a.b 

= ]^^A ab q a q b + V(q a ), 

a, b 

and we conclude that 

Hq a , q a ) = T(q a , q a ) + V{q a ) = total mechanical energy. (2.5.5) 

In all generality, therefore, the function h is the system's total energy, and this is 
conserved whenever L does not depend explicitly on time. 
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2. 5. 3 Invariance of the EL equations under a change of Lagrangian 

Suppose that we are given a Lagrangian L(q a , q a , t) and that we decide to define a 
second Lagrangian L'(q a , q a , t) by adding to the first Lagrangian a term of the form 
df/dt, where f(q a ,t) is any function of the generalized coordinates q a and of time 
t. What we have then is the operation 

L^L> = L+j t f(q a ,t). (2.5.6) 

Notice that / is quite arbitrary, but that it is not allowed to depend on the gener- 
alized velocities q a . 

We assert that the equations of motion derived from L and L' will be identical. 
The Lagrangians L and L' are therefore equivalent, in the sense that they produce 
the same set of EL equations. In practice this formal property of Lagrangians can 
be useful: A complicated Lagrangian L' can be turned into a simpler Lagrangian 
L by removing a superfluous total time derivative. We will use this method of 
simplification in a later section. 



Exercise 2.16. Read through Sec. 2.4 again and figure out where a Lagrangian could 
have been simplified using this method. 



To prove our assertion we show that the change in Lagrangian, 

a j df ^ df . df 
AL= dt = 2 f dq b qb+ M> 

b 

produces no change in the equations of motion. The EL equations derived from L' 
are 

d dV dL' 
dt dq a dq a ' 

Writing V = L + AL, this becomes 

d dL dL d dAL dAL 
dtdq a dq a dt dq a dq a 
These will be identical to the EL equations derived from L if and only if 

d dAL dAL _ () 
dt dq a dq a 

Let us verify that this equation is always satisfied. 
Because / does not depend on q a , we have that 

dAL d df . df 



dq a dq a dq b dt 

\ - df dq b 

^ dq b dq a 

df_ 

dqj 

because dqb/dq a is 1 when b = a and otherwise. For example, q\ depends only on 
qi and on no other variable, so that dqi/dqi = 1 while dq\/dq2 = dqi/dqs = • • • = 0. 
From this it follows that 

d dAL d df 

dt dq a dtdq a 

d 2 f . , a 2 / 



E 



h dq b dq a qb ' dtdq a 
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On the other hand, 



dAL d /V^ df . df 

2^ —lb 1 



dq a dq a d 1b dt 

b 

d 2 f . d 2 f 

^ dq a dq b qb dq a dt' 

and this is equal to the previous result, because the order in which one evaluates 
second partial derivatives does not matter. We conclude that 

d dAL dAL 



dt dq a dq a 

and that the change of Lagrangian has no effect on the EL equations. The equations 
of motion that derive from L 1 = L + df/dt are indeed identical to the equations that 
derive from L. 

There is a more elegant way to prove this result. In this alternative derivation 
we appeal directly to Hamilton's principle. The action S' = J t 1 L' dt associated 

with L', and the action S — Ldt associated with L, are related by 

ft! 

S' = S+ ALdt 

Jta 



/ 

= S + /( go (ti),ti)-/(g„(t ),to). 



The equations of motion are obtained from S' or S by varying the paths q a (t) and 
demanding that the variation of the action be zero to first order in the variations 
Sq a (t). The variations, you may recall, must be subjected to the boundary condi- 
tions Sq a (t ) = 5q a {t\) = 0; the paths must all begin at the same q a (to) and end at 
the same q a (ti). But under these conditions we find that the values f(q a {to),to) aud 
f(q a (ti),t\) can never change under a variation of the paths, and we must conclude 
that 

SS' = 8S. 

An extremum of S will also be an extremum of 5", and the equations of motion 
derived from L and V are guaranteed to be the same. 

While the operation L — > L' = L + df/dt does not affect the equations of motion, 
it may nevertheless change the expressions for the generalized momenta p a and the 
total energy h. The new momenta p' a are given by 

, _ dV _ dL dAL 
Pa dq a dq a dq a ' 

or 

df 

according to our previous computations. The new energy function h! is given by 

h ' = EaPa<ia- L', SO 



, df . df . df 

a b 
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The sums cancel each other out, and we are left with 

h' = h-%. (2.5.8) 
dt v ; 

We find that the expression for the energy is affected only when / depends explicitly 
on time. 



2.6 Charged particle in an electromagnetic field 

The Lagrangian formulation of mechanics is well suited to mechanical systems for 
which the forces can all be derived from a potential-energy function V(q a ); these 
forces will depend on the positions q a , but that they might also depend on the 
velocities q a is normally out of the question. There is, however, an important 
mechanical system for which the forces do depend on velocity: a charged particle 
moving in the presence of an electromagnetic field. In this case the particle is 
subjected to the Lorentz force, and the equations of motion are 

ma = q(E + v x B). (2.6.1) 

Can this equation be derived on the basis of a Lagrangian? 

The answer is in the affirmative. An interesting property of this Lagrangian 
is that it depends on the scalar potential $ and vector potential A instead of 
depending on the fields E and B. Recall that the fields can be expressed in terms 
of the potentials as 

8A 

E = — — -V$, B = VxA. (2.6.2) 

dt K ' 

The potentials are usually introduced to simplify the structure of Maxwell's equa- 
tions. The definition of E implies 

d 

V x E = --V x A - V x (V$); 
dt 

since the curl of a gradient is always zero, this gives 

„ „ dB 

one of the four Maxwell equations. Similarly, the definition of B implies 

V B = V-(Vx A); 
since the divergence of a curl is always zero, this gives 

V-B = 0, 

another one of the Maxwell equations. The remaining two equations can then be 
recast into equations that $ and A must satisfy. 

It is convenient to express the fields in terms of the potentials, but it is important 
to understand that the potentials do not have direct physical meaning. Indeed, it 
is even possible to change the potentials by a certain transformation and leave the 
fields unaffected. This transformation is given by 

$^$' = $_|^ A^ A' = A + Vf, (2.6.3) 

where f(r, t) is an arbitrary function of position and time. Such a transformation 
of the potentials is known as a gauge transformation, and its defining property is 
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that the transformation leaves the fields invariant. Different sets of potentials that 
are related by a gauge transformation describe the same fields and therefore the 
same physical situation. 

Exercise 2.17. Show that the transformation of Eq. (2.6.3) leaves the fields unaffected. 
That is, show that the transformation produces E — > E' = E and B — > B' = B . 

The Lagrangian for a particle of charge q in an electromagnetic field is 

L = ^mv 2 -q$ + qA-v, (2.6.4) 

where v 2 = v ■ v. As stated previously, this depends on the potentials $ and A 
instead of the fields E and B. Another interesting property of the Lagrangian is 
that the potential-energy term V = q$> — qA ■ v depends on the velocity vector v 
as well as the position r. The dependence on position, of course, comes from the 
potentials, which may also depend explicitly on time. 

Let us verify that the Lagrangian of Eq. (2.6.4) docs indeed give rise, via the 
EL equations, to the Lorentz-force equation of Eq. (2.6.1). ft will suffice to verify 
the x component of the equation, which we write as 

m'x = qE x + q(v x B) x = qE x + q(yB z - zB y ). 

Similar computations would allow us to verify also the y and z components, but we 
will not present these here. 

We begin by presenting the Lagrangian in a more explicit form, as 

L = ^m(x 2 + y 2 + z 2 ) - q$ + q(xA x + yA y + zA z ). 

We have 

— = mx + qA Xl 
ox 



and this implies 

d dL _ .. (dA x . dA x . dA x . dA x 
dt dx mX ^ \ dx X dy ^ dz dt 

In this step we took into account the fact that A x depends on time through its 
dependence on the coordinates x(t), y(t), and z(t), and also through its own explicit 
dependence on t; the total time derivative had to be evaluated by using the chain 
rule. Finally, we have 

dL 9$ / dA T . dA, dA 7 



dx ^ dx ^\ dx dx Z dx 



The EL equations are 



() _ (d_A^ a$\ J dA * dA *\ -( dA v dA A J— — 

^\dt dx J ^ \ dx dx J dx dy ) ^ \ dz dx 

or 

dA x 9$\ . fdA y dA x \ . ( dA x dA z 



\ dt dx J \ dx dy J \ dz dx 

In the first set of brackets we recognize E x , in the second B z , and in the third B y . 
We therefore have 

mi = qE x + q(yB z - zB y ) = q(E + v x B) x , 
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and we have recovered the x component of the Lorentz-force equation, as required. 

Exercise 2.18. Make sure that you can also recover the y and z components of 
Eq. (2.6.1). 

In the course of this computation we came across the result 

— = rax + qA x . 
ox 

The left-hand side, we recall, is what was defined in Sec. 2.5.1 as the generalized 
momentum p x associated with the coordinate x. Generalizing, we find that 

p = mv + qA (2.6.5) 

is the generalized momentum vector of a charged particle. It contains a direct 
contribution mv from the particle and an additional contribution qA from the 
electromagnetic field. 

The energy function ft, of a charged particle is given by the general expression 
of Eq. (2.5.4), h = EaPaQa - L. We have 

h = p ■ v — L 

= (m,v + qA) ■ v — ^mv 2 + q§ — qA ■ v, 

or 

h= -mv 2 +q<Z>. (2.6.6) 

It is interesting to see that the terms containing A have canceled each other out; 
the energy function includes only the scalar potential <f>. 

We might ask how L, p, and h change under a gauge transformation. This 
is easily worked out. If we change the potentials from ($, A) to ($', A') using 
Eq. (2.6.3) we find that the Lagrangian becomes 

L' = ^mv 2 - q& + qA' ■ v 
1 2 9f 



2 -rm/ - q\$ - -± ) + q{A + V/) • v 



In other words, 



L' = L + qf f (2.6.7) 

Because the two Lagrangians differ by the total time derivative of a function qf(r, t) , 
the equations of motion derived from L' and L will be identical (refer back to 
Sec. 2.5.3). And because the equations of motion involve the gauge-invariant fields 
E and £?, this conclusion should not come as a surprise. 

Under a gauge transformation the generalized momentum vector becomes 

p' = mv + qA' 

= mv + q(A + Vf), 



so that 



p' = p + qVf; 



(2.6.8) 



2.7 Motion in a rotating reference frame 



89 



the momenta are not gauge invariant. The energy function, on the other hand, 
becomes 

h' = ^mv 2 + q& 

so that 

h' = h-q^; (2.6.9) 
the energy function also is not invariant under a gauge transformation. 

2.7 Motion in a rotating reference frame 

It was mentioned previously that in Lagrangian mechanics, the generalized coor- 
dinates q a are entirely arbitrary, and that in particular they do not have to be 
attached to an inertial frame. (For example, noninertial coordinates were employed 
in Sec. 2.4.5.) A consequence of this fact is that the Lagrangian methods can greatly 
facilitate the description of a mechanical system viewed in a reference frame that is 
not inertial. In this section we examine the motion of particles as viewed in rotating 
frames. We shall first consider the simple case of a particle moving on a turntable, 
and we shall next consider the more interesting case of a reference frame attached 
to a rotating Earth. 



2.7.1 Motion on a turntable 

We consider performing mechanical experiments on particles that move on, or above, 
a turntable that is rotating with a uniform angular velocity ST Our instruments 
are attached to the turntable, and we wish to analyze the motion of the particles 
as measured in the rotating frame. This frame, of course, is not an inertial frame, 
but the methods of Lagrangian mechanics can nevertheless be applied. 

We denote by S' the original inertial frame, and we let (x 1 , y', z') be its associated 
system of Cartesian coordinates; the primes indicate that we will not, ultimately, 
describe the motion of our particles in this coordinate system. We denote by S the 
rotating frame of the turntable, and its associated system of Cartesian coordinates 
is (x, y, z). The turntable is placed in the x'-y' plane, and it is rotating around the 
z' axis, which coincides with the z axis of the rotating frame. As shown in Fig. 2.17, 
the angle between the x and x' axes is Qt; this is also the angle between the y and 
y' axes. 

To work out the relationship between the coordinate systems we use, as a tool, 
the spherical coordinates (r, (?,</>) and (r', &',(/>') assigned to an arbitrary point P. 
Because the frames S and S' share the same origin, we have in fact that r' — r. And 
because they share also the same z axis, we also have 9' — 9. The angles <p' and <j) 
differ, however, and Fig. 2.17 makes it clear that they are related by (p' = <j> + Sit. 
We have 

x = r sin 6 cos (f>, y — r sm9 sincj), z — r cos 9 

and 

x' = r sin 9 cos </>' ', y' = r sin 9 sin 4>\ z' = r cos 9. 

The relationship is obtained by substituting (j)' = <fi + Clt into the previous expres- 
sions. It is a bit more efficient to first construct the complex combinations 



x + iy = r sin #(cos cf> + i sin 4>) — r sin 9 e 1 ' 
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z = z 




Figure 2.17: The rotating frame S of the turntable, as viewed in the inertial frame S' . 
A point P is referred to the inertial frame by its Cartesian coordinates (x',y',z') or its 
spherical coordinates (r, 6, cj>' = <j>+Qt). It is referred to the rotating frame by its Cartesian 
coordinates (x,y,z) or its spherical coordinates (r, #,</>). 



and 



Then we have 



or 



x' + iy' — r sin 6>(cos (/)' + i sin cj)') = r sin 9 e l< ^ . 
x' + iy' = rsin6»e 4W+ot) = e <nt rsin0e^, 
x' + iy' = e mt (x + iy). 



When fully expanded, this is 

x' = x cos Qt — y sin fit, y = y cos fit + x sin fit, z = z. 



(2.7.1) 
(2.7.2) 



Exercise 2.19. Verify that Eq. (2.7.2) follows from Eq. (2.7.1). Then work out the 
inverse transformation, (x',y',z') — > (x,y,z). 

A particle moving in the rotating frame S with a position vector r(t) = [x(t), y(t), z{t)\ 
moves in the inertial frame S' with a position vector r'(t) — [x'it), y'{t), z'(t)]; these 
are related by the transformation of Eq. (2.7.2). The components of the velocity 
vectors are then related by 

x = x cos fit — y sin fit — fl(x sin fit + y cos fit), 
y' = y cos fit + x sin fit — fl(y sin fit — x cos fit), 

z' = z. 

After a fairly laborious calculation, we find that the squared velocity, as measured 
in the inertial frame, is 

v = x + y + y 

= x 2 + y 2 + z 2 ~2fl(yx- xy) + fl 2 (x 2 + y 2 ). (2.7.3) 
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The particle's kinetic energy is then T — ^mv . It contains a contribution from r, 
the particle's velocity vector as measured in the rotating frame, and contributions 
from the rotational motion of the frame (the terms that involve ft). 

Exercise 2.20. Verify Eq. (2.7.3). You will save yourself some work if you use the trick 
of forming complex combinations. 

The particle's potential energy can be expressed in terms of the inertial coordi- 
nates (x',y',z'), but after the transformation of Eq. (2.7.2) it becomes a function 
of the rotating coordinates (x, y, z). Denoting this function V(x, y, z), we find that 
the particle's Lagrangian is 

L = ^m(x 2 + y 2 + z 2 ) - mtt{yx - xy) + \^mVL 2 {x 2 + y 2 ) - V(x, y, z). (2.7 '.4) 

The equations of motion for the particle are then obtained by substituting this into 
the EL equations for x, y, and z. 

The computations that lead to the equations of motion will be left as an exercise 
for the reader. We find 

m'x = -— — + 2mftw + mft 2 x, (2.7.5) 
ox 

my = -— 2mftx + mftV (2.7.6) 

dy 

dV 

mz = -_. (2.7.7) 

These equations can be expressed in vectorial form if we introduce the angular- 
velocity vector ft, defined by 

ft = ftz = [0,0, ft]. (2.7.8) 

The vectorial form is 

WIV = -^applied ~t~ -^Coriolis ~t" -^centrifugal; (2.7.9) 

where 

^applied = -W (2.7.10) 

is the true applied force on the particle, given by the gradient of the potential energy, 
while 

^Corioiis = 2mr x Q. = [2mfty, -2mflx, 0] (2.7.11) 

and 

^centrifugal = mfl x (r x fl) = [mfl 2 x, mtfy, 0] (2.7.12) 

are fictitious forces that arise because the reference frame S is not an inertial frame 
(refer back to the discussion of Sec. 1.1). The Coriolis force is linear in the angular 
velocity ft, and it depends on the particle's velocity vector r; its effect on the 
particle depends on its state of motion. The centrifugal force is quadratic in ft, and 
it depends only on the position vector r; this is always an outward force that points 
away from the centre of motion. 



Exercise 2.21. Verify that Eqs. (2.7.5)-(2.7.7) follow from the Lagrangian of Eq. (2.7.4). 



Exercise 2.22. Show that Eqs. (2.7.5)-(2.7.7) are equivalent to the vectorial equation 
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(2.7.9), together with the definitions of Eqs. (2.7.10)-(2.7.12). 



2.7.2 Case study #1: Particle attached to a spring 

As our first application of the rotating-frame formalism we examine a particle at- 
tached to a linear spring that is free to rotate around the z axis. The particle is 
thus subjected to the potential V — \k(x 2 + y 2 ), which we write in the alternative 
form 

V=^muj 2 (x 2 + y 2 ), (2.7.13) 

in which to 2 = k/m is a stand-in for the spring constant k. For simplicity we assume 
that the particle is confined to the x-y plane and we set, accordingly, z = z = in 
all equations. 

The equations of motion of Eqs. (2.7.5) and (2.7.6) become 

x = 20y+ (Vt 2 - lo 2 )x, y = -2Vlx + (O 2 - uj 2 )y. (2.7.14) 

Let us first analyze these equations in the limit of no rotation, O = 0. In this case 
they reduce to x = —uj 2 x and y = —uj 2 y, the equations of simple harmonic motion. 
The general solution to the equations of motion is then 

xn=o(t) — acos(ujt + a), yn=o(t) — bcos(ut + (3). (2.7.15) 

The four constants a, 6, a, and (5 can be related to the four initial values x(0), x(0), 
y(0), and ?/(0). Equations (2.7.5) are parametric equations for the motion of the 
particle in the x-y plane, and it is easy to show that the trajectory is elliptical. 

To analyze the equations in the general case (Cl ^ 0) we once more employ the 
clever trick of forming complex combinations. We introduce £ = x + iy and we 
combine the two equations (2.7.14) into a single equation for £: 

£ = x + iy 

= 2Q(y - ix) + (n 2 - lo 2 )(x + iy) 
= -2ifl(x + iy) + (fl 2 -cj 2 )(x + iy), 

or 

£ + 2^-(ft 2 -w 2 )£ = 0. (2.7.16) 

To find solutions to this equation we use a trial expression of the form £ = ce lA *, 
where c and A are complex constants. Substitution into Eq. (2.7.16) produces a 
quadratic equation for A, 

A 2 + 2QX + (S! 2 - to 2 ) =0, 

which factorizes as 

(\ + n + uj)(\ + n-Lu) = o. 

The solutions, obviously, arc A = — (CI ± lo), and the general solution for £ is 
or 

where c\ and C2 are complex numbers. 

To help us understand what we have just found, we observe that if we let f2 go 
to zero, our general solution for £ becomes cie~ lu)t + c 2 e lLJt . This is the solution in 
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the limit of no rotation, and this must be equal to xq = q + iyn=o, which was given 
by Eq. (2.7.15) above. So we can write our general solution as 

C(t) = e- mt [x n =o(t) + i ya =o(t)] . (2.7.17) 

Written in full, this is 

x(t) =xn=o{t) cos fit + yn=o(t)smQ,t, y(t) — yn=o(t) cosfM - x n=0 (t) sinftt. 

(2.7.18) 

What is the meaning of these results? The answer is simple. Comparison with the 
transformations of Eqs. (2.7.1) and (2.7.2) shows that the motion of the particle on 
the turntable is a rotated version of the motion that would take place in an inertial 
frame; the rotation angle is here —Qt instead of +ilt. This is easy to understand: 
The motion XQ = o(t), yii=a(t) is what the particle would do in an inertial frame; 
because, however, we are measuring this motion in a rotating frame, we see a 
rotated version of the inertial motion. This conclusion is confirmed by substituting 
our solution of Eqs. (2.7.17), (2.7.18) into Eqs. (2.7.1), (2.7.2); the result is 

x' (t) = x n =a (t) , y'{t) =yn=o(t), 

as expected. The motion of the particle as measured in the rotating frame is shown 
in Fig. 2.18 for three selected values of fi. 



Exercise 2.23. Fill in the mathematical gaps that were left behind in the presentation 
of this subsection. 



2.7.3 Motion on a rotating Earth. Kinematics 

We wish to describe the motion of a mechanical system from the point of view of an 
observer attached to a point P on the surface of a rotating Earth. This will be done 
with the help of a Cartesian frame (x, y, z) whose origin will be at P, and which will 
rotate along with the Earth. We will construct this Cartesian coordinate system 
in two stages. In the first stage we will momentarily assume that the Earth does 
not, in fact, rotate around its polar axis; in the second stage we will incorporate 
the rotation. 

We first place a Cartesian frame (x',y',z') at the centre of the nonrotating 
Earth. Neglecting the Earth's motion around the Sun, we consider this to be an 
inertial frame. Our end goal in this subsection is to relate (x,y,z), the local frame 
at P, to the inertial frame (x' , y' , z'). Our first step toward this goal is to introduce 
the spherical coordinates (r' ,6' ,</>'), which arc related to the original Cartesian 
coordinates by 

x' = r' sin 9' cos </>', y' — r' sin 9' sin <j>' , z' — r 1 cos 9'. 

As shown in Fig. 2.19, our point P on the Earth's surface is at a distance r' = R 
from the centre, and its position on the sphere is determined by the colatitude 9' 
and the longitude <j>' . (The latitude A' is related to the colatitude by A' = \ — 9'; 
thus the colatitude of the equator is 90° while its latitude is 0°.) Because the Earth 
is not yet rotating the longitude of P is a fixed angle; when we later incorporate the 
rotation into the picture we will put <j>' — fit, with il denoting the Earth's angular 
velocity. 

The spherical coordinates come with a set of basis vectors (f , 8', (/>'). Following 
the discussion of Sec. 1.2, we derive that these vectors are related to the Cartesian 
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no rotation 
low angular velocity ■ 




no rotation 
medium angular velocity ■ 




no rotation 
high angular velocity ■ 




Figure 2.18: A particle attached to a linear spring is viewed from a rotating frame. The 
upper graph was generated with Q/lj = 0.3, the middle graph with Q/lu = 0.7, and the 
lower graph with 0,/ui = 1.3. In all cases the initial values were set to x(0) = 1, x(0) = 0, 
2/(0) = 0, and y(0) = 0.4. The elliptical motion of the particle, which takes place when 
£7 = for the same initial conditions, is also shown for comparison. 
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Figure 2.19: An inertial frame (x',y',z') attached to the centre of the Earth, and a 
point P on the surface, described by colatitude 8' and longitude <f)' . The position vector 
of P relative to the inertial frame is R. 

basis (x',y',z') by 

dr' 

f' = — = sin 9' cos tj)'x' + sin 9' sin cf>'y' + cos 9'z', (2.7.19) 
or 

1 dr' 

6' = -— - = cos 9' cos ch' x' + cos 9' sin cb'y' -sin 9' z', (2.7.20) 
r 69' 

1 dr' 

<t>' = ^ = - sin + cos (2.7.21) 
r sin £/' oqy 

Here r' is the position vector expressed in terms of the spherical coordinates, 

r' = r' sin 6' cos $ x' + r' sin 9' sin 4>' y 1 + r' cos 6' z! . (2.7.22) 

The position vector of the point P on the surface is 

R = R sin 9' cos <p' x' + R sin 9' sin <p' y' + R cos 9' z' . (2.7.23) 

The spherical coordinates are useful to specify the position of the laboratory on 
the Earth's surface, but they are not so useful to describe the motion of mechanical 
bodies that would take place in this laboratory. For this purpose we introduce 
another Cartesian frame (x, y, z) whose origin will be at P. The orientation of this 
frame will be set by the directions of the basis vectors (f',0', cf>'). Thus, the z axis 
will point away from the surface, and will be aligned in the direction of f'\ the x 
axis will point in the southern direction, and will be aligned in the direction of 8'; 



96 



Lagrangian mechanics 



and the y axis will point in the eastern direction, and will be aligned in the direction 
of 4>' . We therefore set 

x = 0', y = 4>', z = f'. (2.7.24) 

The situation is illustrated in Fig. 2.20. 

We denote by r the position vector of a particle located at a point Q near the 
surface of the Earth, relative to the surface point P to which our frame (x, y, z) is 
attached. As usual we will resolve this vector in the basis (x, y, z) and identify the 
components with the particle's coordinates. We have 

r = x x + yy + z z, 

and if we involve Eq. (2.7.24) we obtain 

r = xO' + y4>' + zf'. 

If we now substitute Eqs. (2.7.19)-(2.7.21) and rearrange what we get, we find 

r = (x cos 9' cos 4> — j/sin <f>' + z sin 9' cos 4>)x 

+ (x cos 9' sin 4> + y cos (f)' + z sin 9' sin <j)')y' 
+ (-x sin 6' + z cos 0')z' . 

The position vector of the particle relative to the centre of the Earth is R + r. 
According to Eq. (2.7.23) and our previous result, this is 

R + r = [a; cos #' cos 0' — ysin^' + (R + z) sin 6*' cos </>'] a;' 

+ [a; cos 9' sin </>' + y cos <j>' + (R + z) sin 9' sin 0'] y' 
+ [-x sin 9' + (R + z) cos 9'] z! . 

The components of this vector in the original Cartesian basis [x',y',z') are the 
original Cartesian coordinates (x',y',z') of the particle at Q. We have obtained, 
therefore, the transformation 

x' = x cos 9' cos <f>' - y sin <p' + (R + z) sinfl' cos <j>' , (2.7.25) 
y' = x cos 0' sin 0' + y cos <j)' + (R + z)sm0' sin <j)' , (2.7.26) 
z' = -xsm9' + {R + z) cos 9', (2.7.27) 

between the two systems of Cartesian coordinates. 

Our considerations so far have relied on the fiction of a nonrotating Earth. To 
finally incorporate its rotation into the picture we set <p' — Clt into Eqs. (2.7.25)- 
(2.7.27), with f2 denoting the Earth's angular velocity. We also, at the same time, 
fix the colatitude of our laboratory to 9' — a. The transformation between the local 
rotating frame (x, y, z) and the original inertial frame (x 1 , y' , z') is finally given by 

x' = x cos a cos ilt — y sin ilt + (R + z) sin a cos fit, (2.7.28) 
y' = rrcosasinfM + y cos fit + (R + z) sin a sin fW, (2.7.29) 
z = -x sin a + (R + z) cos a. (2.7.30) 

We recall that R is the Earth's radius, that the x direction points due south, that 
the y direction points due east, and that the z direction points up, away from the 
surface. 



Exercise 2.24. Fill in the mathematical gaps that were left behind in the presentation 
of this subsection. 
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Figure 2.20: The local Cartesian frame (x, y, z) at P. The x direction points south, the 
y direction points east, and the z direction points up. The position vector of a point Q 
relative to P is denoted r. 



2.7.4 Motion on a rotating Earth. Dynamics 

Having established the transformation of Eqs. (2.7.28)-(2.7.30) we may now turn 
to the task of describing the dynamics of a particle as viewed in the local rotating 
frame. The particle's coordinates (x, y, z) are changing with time and we let (x, y, z) 
denote the components of the velocity vector in the local rotating frame. In the 
inertial frame, according to Eqs. (2.7.28)-(2.7.30), the components of the velocity 
vector are 

x = x cos a cos fit — y sin fit + i sin a cos fit 

— CI \x cos a sin fit + y cos fit + (R + z) sin a sin fit] , 

y = x cos a sin fit + y cos fit + z sin a sin fit 

+ fl [x cos a cos fit — y sin fit + (R + z) sin a cos flt\ , 

z = — isina + icosa. 



A fairly laborious calculation then returns the squared velocity; we find 

v' 2 = x 2 + y 2 + z 2 + 2£l| [(i? + z)y — yz] sin a + [xy — yx] cosa| 

+ fl 2 {[x cos a + {R + z) sin a] 2 + y 2 }. (2.7.31) 

The particle's kinetic energy is T — \mv 12 . Again (as in Sec. 2.7.1) we see that 
the kinetic energy has a contribution from the particle's motion within the local 
rotating frame, and contributions from the motion of the frame; these depend on fl 
and the laboratory's colatitude a. 



Exercise 2.25. Verify Eq. (2.7.31). 



The particle's potential energy comes from two different sources, which we choose 
to distinguish in this subsection. The first is the Earth's gravity, and this contribu- 
tion to the potential energy is V^avity = mgz, as usual. The second contribution 
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comes from all the other forces acting on the particle; we write this as V^thor = U. 
The total potential energy is then 

V = mgz + U(x,y,z). (2.7.32) 

The particle's Lagrangian is, finally, 

L = ^m(x 2 + y 2 + z 2 ) + mO| [(R + z)y — yi] sin a + [xy — yx~\ cos a j 

+ ^™^ 2 { [x cos a + (R + z) sin a] 2 + y 2 | - mgz - U(x,y,z). (2.7.33) 

The equations of motion in the local rotating frame are obtained by substituting 
this into the EL equations. 

Omitting the detail of the calculations, the equations of motion are 

dU 

mx = — — h 2mfly cos a + mtt 2 [xcos a + (R + z) sin a] cos a, (2.7.34) 

dU 

my = — — 2mfl(xcosa + zs'ma) + mQ 2 y, (2.7.35) 

dy 

dU 

mz = —mg — h 2mVLysina + mfi 2 [icosa + (R + z) sin a] sin a. (2.7.36) 

oz 

These equations can be expressed in vectorial form if we introduce the angular- 
momentum vector n. This is given by l~i = Viz 1 , and in the local rotating frame we 
have the components fl x — fi • x = ilz' ■ 0' = — O sin a, Cl y — ft ■ y — Viz' ■ <j>' = 0, 
and fl z = CI ■ £ = ilz' ■ f' = Q cos a. The vector is therefore given by 

Jl = — ilsmax + Cl cos a z — [—0 sin a, 0, f2 cos a] (2.7.37) 

in the local rotating frame. We also re-express the vector R of Eq. (2.7.23) as 

R = Rz = [0,0, i?]; (2.7.38) 

this gives the position of the laboratory relative to the Earth's centre. 
The vectorial equation is 

mr = mg + F app ii od + F Co riolis + ^centrifugal, (2.7.39) 

where g = —gz, = [0, 0, — g] is the acceleration of gravity, -F app iicd = — VC/ is the 
net force coming from all other interactions, 

-Fborioiis - 2mr x Cl (2.7.40) 

is the Coriolis force, and 

Centrifugal = mfl x [(R + r) x «] (2.7.41) 

is the centrifugal force. In Eq. (2.7.39), mg and -F app n c d are genuine forces acting 
on the particle, while Fcorioiis and Centrifugal are fictitious forces that arise from 
the rotational motion of the frame. 

Exercise 2.26. Verify that Eqs. (2.7.34)-(2.7.36) follow from the Lagrangian of 
Eq. (2.7.33). 



Exercise 2.27. Show that Eqs. (2.7.34)-(2.7.36) are equivalent to the vectorial equation 
(2.7.39), together with the definitions of Eqs. (2.7.37), (2.7.38), (2.7.40), and (2.7.41). 
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2. 7. 5 Case study #2: Particle released from rest 

To gain some insight into the effects of Earth's rotation we shall determine what 
happens to a particle that is released from rest at a great height h in the Earth's 
gravitational field. Because the Earth's rotation is quite slow, it will be sufficient to 
work consistently to first order in the angular velocity f2. We will therefore neglect 
all terms of order S7 2 in the equations of motion; this means that we will keep the 
Coriolis term, but discard the centrifugal term. 

At this level of accuracy the equations of motion (2.7.34)-(2.7.36) reduce to 

x = 2flycosa, y = — 2f2(xcosai + isma), z = — g + 2Cly sin a. (2.7.42) 

We impose the initial conditions x(0) = y(0) = 0, z(0) = h, as well as x(0) = y(0) = 
i(0) = 0. 

We shall solve Eqs. (2.7.42) by the method of successive approximations. We 
first express the particle's coordinates x(t), y(t), and z(t) as formal expansions in 
powers of Q. Thus, 

x(t) = x {t)+n Xl (t)+- ■ • , y(t) = y (t) + n yi (t) + - ■ • , z{t) = z (t)+Slz 1 (t) + - ■ ■ . 

In terms of these new quantities the initial conditions become Xo(0) = yo(0) = 0, 
zo(0) - h, Xl (0) = i/i (0) - zi(0) - 0, as well as x Q (0) = y (0) = i (0) = ii(0) - 
yi(0) = ii(0) = 0. Substituting the expansions into Eq. (2.7.42) yields 

x + Q,X\ + ■ ■ ■ = 2O(y + • • •) cos a, 

yo + Qy\ H = -2f2(x cosa + z sina H ), 

z Q + ttzi H = -g + 2f2(j/o H ) sin a. 

Equating powers of produces the set of equations 

xo = 0, y = 0, z Q = -g, 

xi = 2yo cos a, j/i = — 2xo cosa — 2io sina, 'i\ — 2yo sin a. 

The zeroth-order equations are easy to solve. In view of the initial conditions, the 
solutions are 

x (t) = 0, y O (t) = 0, z (t) = h-^gt 2 . 

With x = yo = and z = —gt the first-order equations become 

Xi =0, y\ = 2gtsma, 'i\ = 0. 

These equations also are easy to solve. Taking once more the initial conditions into 
account, we find that the solutions are 

Xl {t) = 0, y x {t) = ^i 3 sina, Zl {t) = 0. 

The complete solution to the equations of motion is therefore 

x(t) = + O(n 2 ), (2.7.43) 
y(t) = h gVt sin a)t 3 + O (SI 2 ), (2.7.44) 

z(t) = h-^gt 2 + 0(n 2 ). (2.7.45) 

Observe that the factor gQ sin a is positive for any colatitude a, except at the North 
and South poles where it is zero. The fact that y increases during the motion means 
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Figure 2.21: Geometry of Foucault's pendulum. The motion of the pendulum is de- 
scribed by the swing angle 9 and the rotation angle <j). 

that the particle, which would just fall straight down in the absence of rotation, is 
in fact drifting away in the eastward direction. 

The time required for the particle to hit the ground is t = (2h/g) 1 ^ 2 , and by 
this time to total eastward displacement is 



If the object is released from a height of 100 m at Guelph's colatitude (approximately 
47°), this amounts to approximately 1.6 cm. The Coriolis force produces a rather 
small effect. 



Exercise 2.28. Compute this number. 



A dramatic demonstration of Earth's rotation came from Foucault's celebrated pen- 
dulum, which was first displayed in front of an audience at the Observatoire de Paris 
in 1851. The idea is that while the Earth rotates the pendulum keeps oscillating 
in a fixed plane as viewed from an inertial frame; as seen from the Earth's rotating 
frame, however, it is the pendulum that appears to be rotating. More precisely 
stated, as viewed in the local rotating frame the pendulum is swinging in a plane 
which rotates at a steady rate f2 p i an c; this is directly related to fi, the rate at which 
the Earth itself is rotating. 

In this last application we will examine the motion of a pendulum in the local 
rotating frame. We aim to calculate f2 p i a ncj m an approximation in which we neglect 
the centrifugal effects (which are proportional to ft 2 ) but retain the Coriolis effects 
(which are proportional to f2), and in an approximation in which the amplitude of 
the pendulum's oscillations is assumed to be small. 

In the spirit of Lagrangian mechanics we will use the generalized coordinates 9 
and 4> to describe the motion of the pendulum, as illustrated in Fig. 2.21. (Notice 
that 9, as defined here, is not the standard spherical coordinate.) The relation 




(2.7.46) 
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between these generalized coordinates and the original Cartesian system (x, y, z) is 
given by 

x = £ sin 9 cos <f>, y = £ sin 9 sin (f>, z~h — £ cos 9. (2.7.47) 

The pendulum has a constant length £, and its pivot point is attached at a height 
h above the Earth's surface. As usual we introduce the quantity 

cu 2 - g/t, (2.7.48) 

and we will use it instead of g. 

The two degrees of freedom of the pendulum are represented by the angles 9 
and <j>. The first angle, 9, is the usual swing angle of the pendulum. In the absence 
of rotation, the pendulum would swing in a fixed plane, and <j> would stay constant; 
in this situation there would be a single degree of freedom. But as we shall see, 
the Earth's rotation will force 4> to change steadily with time, and the plane of the 
pendulum will rotate in the x-y plane; in this situation there are two degrees of 
freedom, and 4> is the rotation angle of the swing plane. 

According to Eq. (2.7.47) the pendulum's velocity vector has the components 

x = £ cos 6 cos (f) 9 — £ sin 9 sin </> <fi, 
y = £ cos 9 sin <f> 9 + £ sin 9 cos <f> <j>, 
z = £s\n99. 

It follows that the squared velocity is 

v 2 = £ 2 9 2 + £ 2 sm 2 9<f ) 2 . 

We make this substitution, along with z — h — £cos9, into the Lagrangian of 
Eq. (2.7.33), and we allow ourselves (once more) to neglect the centrifugal terms 
that are proportional to fi 2 . This yields 

L = X -m,£ 2 {9 2 + sin 2 9 j> 2 ) 

+ mO| [(R + h — £cos 9)y — yz] sin a + [xy — yx] cosa j 
— mg(h — £cos9). 

In this Lagrangian we recognize a term mQ(R + h)ysma and another term —mgh 
that can both be discarded. We can omit the first term because it is the time 
derivative of the function mCl(R + h)ysmot, and as we have learned in Sec. 2.5.3, 
such a term will not contribute to the equations of motion. And we can omit the 
second term for the simple reason that it is a constant; it will also contribute nothing 
to the equations of motion. 

The Lagrangian simplifies to 

L = ^m£ 2 (9 2 + sin 2 9 <j) 2 ) 



mflj— [£cos9y + yz] sina + [xy — yx] cosaj 
mg£ cos 9. 



This becomes 



L = m£ 2 l^(9 2 + sin 2 9 <j) 2 ) - ft sin a (sin </> + sin 6> cos 6> cos </> 

+ ncosasin 2 9(j> + Lj 2 cos9\, (2.7.49) 
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after involving the transformation between the Cartesian coordinates (x, y, z) and 
our generalized coordinates (6, <j>). This is the pendulum's Lagrangian, up to terms 
of order ft 2 that have been neglected. 



Exercise 2.29. Verify Eq. (2.7.49). 



We may simplify the Lagrangian further if we assume that the amplitude of the 
pendulum's oscillations is sufficiently small that we can use the approximations 

sin0~0, costf ~ 1 - ^9 2 . 
Neglecting all terms of order 9 3 and higher, we obtain 

L = m( 2 \^{9 2 + 9 2 4> 2 ) -ft sin a (sin 00 + (9 cos 
+ il cos a 9 2 6 + uj 2 ( 1 ! 



2 

In this simplified Lagrangian we recognize a term proportional to 

d (, 



sin + 6 cos <ft — — (o sin 0^ : 



we can discard this term from the Lagrangian because this is a total time derivative. 
We may also remove the constant term uj 2 . After these simplifications, our final 
Lagrangian will be 

L = m£ 2 \^{8 2 + 8 2 4> 2 ) + ftcosa# 2 0- ^ 2 2 j. (2.7.50) 

This is the simplified Lagrangian for Foucault's pendulum, and it is valid in the 
limit of small swing angles. 

The equations of motion are obtained by substituting the Lagrangian of Eq. (2.7.50) 
into the EL equations. Omitting all details, we find that the equation for 9 is 

9 + lu 2 9 - 0(0 + 2ft cos a)6 = 0. (2.7.51) 

And the fact that the Lagrangian does not depend explicitly on implies that the 
(rescaled) generalized momentum 

P(f> = 6» 2 (0 + ftcosa) (2.7.52) 

is a constant of the motion. 



Exercise 2.30. Verify that the EL equations produce Eqs. (2.7.51) and the statement 
that p^, as defined by Eq. (2.7.52), is constant. 



Solving Eq. (2.7.52) for gives = — ft cos a+p^/9 2 . We see that unless = 0, 
would blow up as 9 — > 0, that is, whenever the pendulum crosses the z axis. To 
eliminate this unphysical behaviour we set — 0, so that 

0= -ft cos a. (2.7.53) 

This is our key result. In the absence of rotation we would find that = 0, and 
we would conclude that the pendulum swings in a fixed plane, as we had foreseen 
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at the beginning of this subsection. With the Earth's rotation, however, we find 
instead that <p — — ficosa, and this means that the swing plane is rotating with a 
constant angular velocity given by 

^plane = -H cos a. (2.7.54) 

This is the Foucault effect. 

When the pendulum is located in the northern hemisphere, we have that cos a > 
and we find that fipianc < 0, so that the swing plane rotates clockwise. When, 
on the other hand, we go to the southern hemisphere, we have that cos a < and 
f^piane > 0, so that the swing plane rotates counterclockwise. The Foucault effect 
is maximum at the poles, and it vanishes at the equator. 



Exercise 2.31. Calculate A<j>, the angular displacement of the swing plane after 1 
hour, when the Foucault pendulum is located in Guelph (colatitude 47°). 



Exercise 2.32. This is the laboratory component of the course. There is a Foucault 
pendulum in the foyer of the MacNaughton building, and you are asked to determine its 
value for A(j>. Measure the angular position of the swing plane when you first arrive in 
the Department of Physics, and record the time. Repeat the measurement when you are 
about to leave. Divide the difference in angular positions by the time interval measured in 
hours, and obtain your experimental value for A<f). How close is it to the theoretical value 
obtained in the previous exercise? 



To finish off our discussion of the Foucault pendulum we return to Eq. (2.7.51), 
in which we substitute Eq. (2.7.53). The result is 

e + Lo 2 6 - (-ftcosa)(+ftcosa)6> = 0, 

or 

9 + (lo 2 + n 2 cos 2 a)9 = 0. 

This is the equation for simple harmonic motion, and it appears to indicate that 
the natural frequency of the pendulum is shifted from w to \Jlo 2 + fi 2 cos 2 a by 
the Earth's rotation. This conclusion, however, is premature. In the course of our 
calculations we have consistently neglected all terms of order ft 2 , starting with the 
Lagrangian of Eq. (2.7.49). We must continue to do so, and the previous equation 
must be approximated by 

9 + uj 2 e = 0. (2.7.55) 

This is still the equation for simple harmonic motion, with the original natural 
frequency lu = y/g/t. The general solution to this equation is 6(t) = 9 a cos(ujt + 5), 
where 9 and 5 are constants. 

To sum up, we have found that to first order in f2, the pendulum swings as a 
simple harmonic oscillator, but that it does so in a plane that rotates around the 
vertical direction with an angular velocity f2 p i a nc = — ^ cos a. 
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1. (a) Find the curve y(x) that passes through the endpoints (0,0) and (1,1) 
and minimizes the functional 



/dy\ 2 
\dx) 



dx. 
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(b) What is the minimum value of the functional? 

(c) Evaluate I[y] for a straight line that passes through the same two end- 
points. Is this smaller or larger than your answer in part (b)? 

2. You are mounting an expedition to reach the other side of a volcano, and 
you wish to determine the path that will minimize the distance traveled. To 
perform this calculation you decide to use cylindrical coordinates (p, <j>, z) and 
you model the volcano as the conical surface z = 1 — p. [The cylindrical 
coordinates are defined by x — pcos<f>, y — psin^, and z = z.] You describe 
the path by the function p(<ft) and let the angle <j> range through the interval 
— f < 4> < f ■ The starting point of the expedition is (p = 1, <j> = — |, z = 0) 
and the end point is (p = 1, <f> = +|, z = 0). You wish to find the path p((j>) 
that minimizes the total distance traveled from this side of the volcano to the 
other side. 

(a) Prove that the functional that must be minimized is 



where p' = dp/dcj>. 

(b) Find the differential equation that the minimal path must satisfy. 

(c) Show that 



is a solution to this differential equation, and that it satisfies the bound- 
ary conditions; conclude that this must be the minimal path. Produce a 
plot of z = 1 — p as a function of <p. 

(d) Calculate the minimum distance s m i n . Compare this with the distance 
that would be traveled if the path were instead chosen to be p((f>) = 1. 

3. A bead of mass m slides on a frictionless wire that is shaped in the form of a 
cycloid. This is described by the parametric equations 



where a is a constant and the parameter 9 ranges through the interval < 
9 < 2-7T. The bead is subjected to gravity, and it oscillates back and forth on 
the wire. 

(a) Using 9 as a generalized coordinate, calculate the bead's Lagrangian. 

(b) Show that the equation of motion for the bead is 




i 




x = a(9 — sin 9) 



y = a(l + cos 9), 



2(1 -cos 9)9 + sin 9 9 2 - - sin 61 = 0. 



(c) 



Show that the transformation u = cos(^9) brings this equation to the 
much simpler form 

u + lu 2 u = 0, 



(d) 



and find an expression for u>. 

What is the period of the bead's oscillations? 
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4. A particle of mass m moves on a paraboloid of revolution described by the 
equation 



where a is a constant (see the figure). The particle is subjected to gravity, 
so that its potential energy is V = mgz. Using the cylindrical coordinates p 
and <p as generalized coordinates, find the Lagrangian of the particle. [The 
cylindrical coordinates are defined by x = pcos0, y = psincf).] 



5. A straight frictionless wire is attached at a height h to the z axis, and it makes 
an angle a relative to the z axis. The wire rotates around the z axis with 
a constant angular velocity tt. A bead of mass m slides on the wire and is 
subjected to gravity; it is at a distance r from the point at which the wire is 
attached to the z axis (see the figure). 



(a) Using r as a generalized coordinate, calculate the bead's Lagrangian. 

(b) Obtain the equation of motion for the bead. 

(c) Solve the equation of motion, assuming that the bead starts from rest at 
the point of attachment; this means that r(t = 0) = and r(t = 0) = 0. 
Show that your solution can be expressed in the form 





X 



y 



r(t) 



g cos a ' 



cosh(fct) — f , 



fc 2 L 



where k 



= sin a. 
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6. In Sec. 2.4.5 we examined the motion of a planar pendulum whose pivot point 
was forced to rotate with a constant angular velocity f2. Here we consider 
instead a planar pendulum whose pivot point is forced to move horizontally 
with a constant acceleration a. This motion takes place in the x direction, 
and ^Cpivot = 2^ • 

(a) Using the swing angle 9 as a generalized coordinate, find the pendulum's 
Lagrangian. 

(b) Derive the equation of motion for the pendulum. 

(c) Show that the pendulum can be in an equilibrium state in which 9{t) = 
9 cq = constant. Show that the equilibrium position is determined by 

„ a 
tan Pea = — • 
q 9 

(d) Suppose that the pendulum oscillates about its equilibrium position, so 
that 9 — 9 cq + <f>, with <f> denoting the angular deviation away from 
equilibrium. Assuming that is a small angle, show that its behaviour 
is governed by an equation of the form 

'(j) + UJ 2 (j> = 0. 

Find an expression for lu 2 in terms of <?, a, and I. 

7. In this problem we examine the motion of the same pendulum as in the pre- 
ceding problem, but we now let the pivot point move vertically upward with 
a constant acceleration a. Find the pendulum's Lagrangian and derive its 
equation of motion. 

8. A particle of mass m is constrained to move on the surface of a cylinder. The 
cylinder is described in cylindrical coordinates by the equation p = R, where 
p is the distance from the z axis and R is the cylinder's radius. The particle is 
subjected to a force directed toward the origin of the coordinate system and 
proportional to the distance between the particle and the origin; this force is 
described by F = —kr, where k is a constant and r is the particle's position 
vector. 

(a) Using the cylindrical coordinates z and <f> as generalized coordinates, find 
the particle's Lagrangian. 
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(b) Derive the particle's equations of motion and find their general solutions. 

9. A particle of mass m and electric charge q moves in the presence of a vector 
potential 

A= ^B (-yx + xy), 

where B is a constant. 

(a) What is the magnetic field JB? 

(b) What is the particle's Lagrangian? 

(c) What are the particle's equations of motion? 

(d) What is the general solution to these equations? Describe how the par- 
ticle moves in this magnetic field. 

10. A particle of mass m and electric charge q moves in the presence of a vector 
potential 

A = — ^ sin[fc(z — ci)l x, 
kc 

where E is a constant, c is the speed of light, and k is another constant. 

(a) What are the electric field E and magnetic field Bl What kind of elec- 
tromagnetic field does this vector potential represent? 

(b) What is the particle's Lagrangian? 

(c) What are the particle's equations of motion? 

(d) Find the general solution to these equations in the nonrelativistic limit, 
in which x/c <1, i//c«l, and i/c <C 1. 

11. A plumb bob at rest near Earth's surface, in a laboratory at eolatitude a, is 
subjected to a force F = mg + F ccntr ifugai when viewed in a local rotating 
frame. We write this force as F — mg cS and define 

9eS = g + -^centrifugal/™ 

as the effective gravitational field felt by the plumb bob. Assuming that 
£l 2 R/g is a small number (which it is), calculate: 

(a) The fractional difference \g e e — g\/g between the magnitudes of the effec- 
tive and true gravitational fields; quote your result as a percentage. 

(b) The angle that g c e makes relative to the z direction; quote your result 
in degrees. 

Assume that the laboratory is situated in Guelph, at a eolatitude of 47°. 

12. A projectile is launched from Earth's surface with an initial velocity v(t = 
0) = V\X + v-iy + v 3 z. The launch pad is situated at the origin of a local 
rotating frame at eolatitude a, and the motion of the projectile is examined 
in this reference frame. Working consistently to first order in f2 (and therefore 
neglecting centrifugal effects), obtain the motion of the projectile for all times 
t. In other words, solve the equations of motion for the functions x(t), y(t), 
and z(t), incorporating the initial conditions x(0) = y(0) = z(0) = 0, as well 
as x(0) = V\, y(0) = v 2 , and i(0) = v 3 . 
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13. A projectile is launched directly upward from Earth's surface (so that v\ — 
v 2 = but V3 ^ 0). The launch pad is situated at the origin of a local 
rotating frame at colatitude a, and the motion of the projectile is examined 
in this reference frame. Working consistently to first order in fi, calculate the 
projectile's position when it finally hits the ground. Express your result in 
terms of g, Q,, a, as well as h, the maximum height reached by the projectile. 
In which direction is the projectile displaced? [This problem is a special case 
of the preceding problem. You may find it useful to solve the general problem 
first.] 

14. A cannonball is fired due east with an initial speed vq, and with an angle 6 with 
the horizontal. The cannon is placed at the origin of a local rotating frame at 
colatitude a, and the motion of the cannonball is examined in this reference 
frame. Working consistently to first order in fi, calculate the cannonball's 
lateral displacement when it finally hits the ground. In which direction is 
it displaced? Does this direction depend on whether the cannon is in the 
northern or southern hemisphere? [The same remark as in the preceding 
problem applies.] 

2.9 Additional problems 

1. Two particles of mass m\ and m2 are attached to a string of constant length 
t. The first particle moves on a frictionless table. The string goes through a 
hole in the middle of the table, and the second particle swings underneath the 
table. The first particle therefore moves in the x-y plane under the action of 
an attractive force directed toward the hole, and the second particle behaves 
as a planar pendulum with a variable distance to the pivot point. 

The first particle is at a distance r to the hole, and its position vector makes 
an angle \ relative to the x axis. The swing angle of the second particle 
(relative to the vertical) is denoted tp. 

Using r, X; an d V" as generalized coordinates, find the Lagrangian of this 
mechanical system. 

2. In an experiment designed to measure the Coriolis effect, a particle of mass m 
is set to move on a large frictionless table. The table is placed in a laboratory 
at colatitude a, and the table is oriented along the x and y directions (with 
x pointing south and y pointing east). The Earth's angular velocity is ft. 

At all times the particle moves on the table with z = 0. It begins its motion 
(when t = 0) at the origin x — y — of the reference frame. Initially it is 
heading due south, with a velocity v(t = 0) = v x as measured in the local 
rotating frame. At later times the particle is observed to move laterally, as 
predicted by the Coriolis effect. 

Calculate the lateral displacement y as a function of the forward displacement 
x. You must perform the calculation consistently to first order in Q, but you 
may neglect all terms of order £1 2 (and higher powers). 

Express your result for y in terms of x, fi, a, and v - 

3. A particle of mass m moves on the interior surface of a hollow hemisphere 
of radius a. The particle's position on the hemisphere is determined by the 
usual angles and <p. 

(a) Show that the particle's Lagrangian is 

L = ^-ma 2 (0 2 + sin 2 6 cf> 2 ) + mgacosO. 
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(b) Derive the equations of motion for the particle. Show that the equation 
for can be expressed in the form 

\e 2 + v{6)=e, 

and find an expression for the effective potential v{&). (The constant e 
is proportional to the particle's total mechanical energy.) 

(c) Provide a rough sketch of v{6). 

(d) Show that a possible solution to the equations of motion is 6{t) = 9 n = 
constant. 

(e) Calculate the particle's speed v when it follows the path described in part 

(d); express your result in terms of a, g, and 6q. 
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Chapter 3 
Hamiltonian mechanics 



3.1 From Lagrange to Hamilton 

As we saw in Chapter 2, the Lagrangian formulation of the laws of mechanics offers 
increased flexibility and efficiency relative to the Newtonian methods, and it is 
based on an appealing principle of least action. In this chapter we add a layer 
of mathematical sophistication to this formulation of mechanics. The resulting 
Hamiltonian formulation of the laws of mechanics gives this area of theoretical 
physics an aura of perfection that has probably not been surpassed by any other 
area of theoretical physics. 

The main goal of the Hamiltonian formulation is to displace the emphasis from 
the generalized velocities q a to the generalized momenta p a , and from the La- 
grangian L(q a ,q a ,t) to a new function H{q a ,p ai t) called the Hamiltonian func- 
tion of the mechanical system, which is numerically equal to the system's total 
mechanical energy. The motivation behind this shift of emphasis is clear: While 
the generalized velocities are rarely conserved quantities, the generalized momenta 
sometimes are, and while the Lagrangian is never conserved, the Hamiltonian usu- 
ally is. The Hamiltonian formulation therefore involves all the dynamical quantities 
that have a chance of being constants of the motion, and this constitutes a useful 
and interesting refinement of the original Lagrangian methods. 



3.1.1 Hamilton's canonical equations 

To see how the reformulation is accomplished, let us go back to Eq. (2.5.4), which 
gives the definition of the function h(q a ,q a ,t), which is also numerically equal to 
the total mechanical energy of the system. This is 

h(q a ,q a ,t) = ^Paia - L(q a ,q a ,t), (3.1.1) 

a 

where 

Pa=^~ (3.1.2) 

dq a 

is the generalized momentum associated with the generalized coordinate q a . Notice 
that here we allow L and h to depend explicitly on time. And notice that the energy 
function is denoted h, not H ; we will explain this distinction later. 
We construct the total differential of h: 



dh = ^2(q a dp a + Pa dq a ) - dL. 



To calculate dL we invoke the chain rule, and write 

,, x ^l dL , dL ,. \ dL , 
dL = T,{^+9q- a d ^)+9i dt - 
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Combining these results gives 

dL 



dh = J2 



dL\ J . 8L , 
q a dp a + [Pa - W7- dq a - -w—dq a 
oq a ) oq a 



8t dL 



From the definition of the generalized momentum we recognize that the coefficient 
of dq a is zero. And since the Euler-Lagrange (EL) equations can be expressed in 
the form p a = dL/dq a , what we have is 

dL 

dh = ^2(q a dp a - Pa dq a ) - — dt. (3.1.3) 

a 

Suppose now that h is given as a function of q a , p a , and t. Then it would follow 
as a matter of mathematical identity that the total differential of h(q a ,p a ,t) is 



„ / dh , dh , \ dh , 



Comparing this with Eq. (3.1.3) reveals that we can make the identifications 
. _ dh_ . _dh_ dL _ _dh 

Qa ~7\ ■) Pa 7{ ; "777 7TT • 

dp a dq a at at 

The first two equations are evolution equations for the dynamical variables q a (t) and 
p a (t). These are almost Hamilton's equations, except for one important subtlety. 

The previous identifications can be made if and only if the function h is expressed 
in terms of q a , p a , and t. If it is so expressed, then we have learned that q a is the 
partial derivative of h with respect to p a keeping q a constant, while p a is (minus) 
the partial derivative of h with respect to q a keeping p a constant. Our function 
h, however, has not yet been expressed in terms of the new variables; it is still 
expressed in terms of the old variables q a , q a , and t. Before we can write down 
Hamilton's equations we must solve for q a in terms of q a and p a , and we must make 
the substitution in h. We must therefore evaluate 

h(q a ,q a {q a ,p a ),t) = H(q a ,p a ,t), (3.1.4) 

and this is what we shall call the Hamiltonian function of the mechanical system. 
The functions h and H are numerically equal, they both represent the total me- 
chanical energy of the system, but only the Hamiltonian H is the required function 
of q a , p a , and t. 

Having clarified this point and made the change of variables from (q a ,Qa) to 
{q a ,Pa), we can finally write down Hamilton's equations, 

d H . dH 

qa = 15—, Pa = -TT~ ■ (3.1.5) 

dp a dq a 

This system of equations is formally equivalent to the original set of EL equations. 
But instead of representing a set of n second-order differential equations for the 
coordinates q a (t) — there is one equation for each of the n degrees of freedom — 
Hamilton's equations represent a set of 2n first-order differential equations for the 
new dynamical variables q a (t) and p a (t). The generalized momenta are now put on 
an equal footing with the generalized coordinates. 

The recipe to arrive at Hamilton's canonical equations goes as follows: 

1. Begin with the Lagrangian L(q a , q a , t) of the mechanical system, expressed in 
terms of any set of generalized coordinates q a and the corresponding general- 
ized velocities q a . 
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2. Construct the generalized momenta p a — dL/dq a , and solve for the general- 
ized velocities to obtain q a (q a ,p a ,t). 

3. Construct the Hamiltonian function 

H(q a ,p a ,t) = ^2p a q a - L 

a 

and express the result entirely in terms of q a , p a , and t; at this stage the 
generalized velocities have completely disappeared from sight. 

4. Formulate Hamilton's equations, 

. _ dH . dH 

qa — Tj ; Pa ^ 

OPa Oq a 

and solve the equations of motion for q a (t) and p a (t); observe that the general- 
ized velocities q a have reappeared, but now as a consequence of the dynamical 
equations. 

Concrete applications of this recipe will be given in the next section. For the 
time being we prefer to explore some of the formal consequences of Hamilton's 
formulation of the laws of mechanics. 

3.1.2 Conservation statements 

We begin with an examination of what Hamilton's equations have to say regarding 
the existence of constants of the motion. 

It follows immediately from the dynamical equation 

dH 

dq a 

that if the Hamiltonian H happens not to depend explicitly on one of the generalized 
coordinates, say then dH/dq* — and p\ = 0. This means that p* will be a 
constant of the motion, and we have established the theorem: 

Whenever the Hamiltonian of a mechanical system does not depend ex- 
plicitly on a generalized coordinate q* , the corresponding generalized 
momentum p» is a constant of the motion. 

We made a similar statement back in Sec. 2.5.1, but in terms of the Lagrangian 
instead of the Hamiltonian. 

Hamilton's equations allow us also to state another theorem, which is very sim- 
ilar: Whenever the Hamiltonian of a mechanical system does not depend explicitly 
on a generalized momentum p*, the corresponding generalized coordinate q* is a 
constant of the motion. This statement is a true consequence of Hamiltonian dy- 
namics, but it is less useful in practice: If g» were a constant of the motion it is 
likely that it would not have been selected as a coordinate in the first place! 

What do Hamilton's equations have to say about conservation of energy? To 
answer this let us consider a general Hamiltonian of the form H(q a ,p a ,t), which 
includes an explicit dependence on t. Its total time derivative is 

dH _s^fdH . dH_ . \ dH 

~dT ~ ^{dq~a qa+ dp~a Pa ) + ~dJ- 

By Hamilton's equations this becomes 

dH v-^, . . . . > dH 

-ft = l^i-PaQa + qaPa) + , 
a 
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Figure 3.1: A trajectory in a two-dimensional configuration space. It is possible for the 
trajectory to intersect itself, because the system can go back to the same position after a 
given interval of time. 

or 



This gives us the statement 

Whenever the Hamiltonian of a mechanical system does not depend 
explicitly on t, it is a constant of the motion: dH/dt = 0. 

Recall that back in Sec. 2.5.2 we derived the relation dh/dt = —dL/dt. Here we 
have instead dH/dt = dH/dt. These statements are compatible by virtue of the 
fact that dL/dt = —dH/dt; you will recall that we came across this identification 
back in Sec. 3.1.1. 

3.1.3 Phase space 

Suppose that a mechanical system possesses n degrees of freedom represented by 
n generalized coordinates q a (with a = 1, 2, • • • , n labeling each one of the n co- 
ordinates, as usual). The n-dimensional space spanned by the g a 's is called the 
configuration space of the mechanical system. The motion of the entire system can 
be represented by a trajectory in configuration space, and the generalized velocities 
q a represent the tangent to this trajectory. This is illustrated in Fig. 3.1. The figure 
shows that a trajectory in configuration space can cross itself: The system could 
return later to a position q a with a different velocity q a . 

The Hamiltonian formulation of the laws of mechanics gives us an alternative 
way of representing the motion. Because the coordinates q a and the momenta p a 
are placed on an equal footing, it is natural to form a 2n-dimensional space that 
will be spanned by the n coordinates and the n momenta. This new space is called 
the phase space of the mechanical system. While the phase space is twice as large 
as the configuration space, it allows a much simpler representation of the motion. 
The reason is that a point (q a ,Pa) in phase space represents the complete state of 
motion of a mechanical system at a given time; by identifying the point we obtain the 
complete information about the positions and momenta of all the particles within 
the system. (By contrast, in configuration space the complete state of motion would 
be represented by a point and the tangent to a trajectory that passes through this 
point.) As the coordinates and momenta change with time the mechanical system 



dH 



dH 

Ik' 



(3.1.6) 
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P 




Figure 3.2: A trajectory in a two-dimensional phase space. So long as the Hamiltonian 
does not depend explicitly on time, it is impossible for the trajectory to intersect itself. 

traces a trajectory in phase space; each point on this curve represents a new time 
and a new state of motion. The tangent to a phase-space trajectory gives the 
phase-space velocity field, (q a ,Pa)- This is illustrated in Fig. 3.2. 

So long as the Hamiltonian does not depend explicitly on time, a trajectory in 
phase space can never intersect itself; as we have seen, this is quite unlike a tra- 
jectory in configuration space. This property of phase-space trajectories is another 
reason why motion in phase space is simpler than motion in configuration space. It 
follows from the fact that the Hamiltonian H(q a ,p a ) is a single-valued function of 
its arguments: there is only one value of H at each point in phase space. To see the 
connection, observe that if H is single- valued, then its partial derivatives dH/dq a 
and dH/dpa will be single- valued also; and by Hamilton's equations this implies 
that the tangent (q a ,Pa) to a phase-space trajectory is a single- valued vector field 
over phase space. If the tangent of a trajectory is unique at every phase-space point, 
the trajectory can never intersect itself. 



Exercise 3.1. Explain why this conclusion does not apply when the Hamiltonian 
depends explicitly on time. 



When the total energy of the mechanical system is conserved we find that the 
coordinates and momenta are constrained by the energy equation H(q a ,p a ) = E = 
constant. In this case the motion will proceed on a fixed "surface" in phase space; 
this "surface", which is called an energy surface, has an intrinsic dimensionality 
of 2n — 1. The existence of other constants of the motion would also restrict the 
motion to a "surface" of lower dimensionality. We will encounter specific examples 
of such "surfaces" in the next section. 

3.1.4 Hamilton's equations from Hamilton's principle 

Hamilton's equations can be derived directly from Hamilton's principle of least 
action, SS = 0. For this purpose the action functional must be expressed in terms 
of the Hamiltonian instead of the Lagrangian. Because H — ^2 a p a qa — L, we write 
it as 




'a 



H dt, 
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or 



■S'= J [Y^Padqa-Hdt). 



(3.1.7) 



The action S[q a ,Pa] must now be thought of as a functional of the variables q a (t) 
and p a (t); all these variables are considered to be independent of each other — they 
become connected only after the action has been extremized and the dynamical 
equations have been imposed. What we have, therefore, is a multi-path functional 
that depends on n paths q a (t) and n additional paths p a (t). Alternatively, we may 
think of S[q a ,p a ] as a functional of a single path in a 2n-dimensional phase space. 

We intend to compute how the action of Eq. (3.1.7) changes when the paths 
are displaced relative to some reference paths q a (t) and p a (t). We will derive the 
equations of motion by demanding that 6S — to first order in the displacements 
5q a (t) and 5p a (t). We will impose the boundary conditions Sq a (to) = 5q a (ti) = 0: 
As in the usual form of the variational principle, all paths must begin and end 
at the same end points in configuration space, q a (to) and q a (ti). We will not, 
however, impose any conditions on the variations Sp a (t); these remain completely 
free, including at t = to and t = t\. 

The variation of the action is given by 

SS = ^(dq a dp a +p a dSq a ) -^(^-Sq a + ^-dp a ^j dt 



to 



E 



dH-\. dH jMS 
p a ddq a + [dq a - -w—dt \ 5p a - —dtdq a 
Op a J Oq a 



To simplify this we write 

Pa dSq a = d(p a Sq a ) - Sq a dp a , 
and we integrate the first term. This gives 



or 



SS 



ss = J2paSq a + Yl 

t -'to 
ti 

E 



dH \ { dH 

dq a - g^~ dt ) S P a ~ [dPa + ~Q^~ dt 



^ _ —)s P - + —)sq 

dt dp a ) V dt dq a ) 



dt 



by virtue of the boundary conditions on 5q a (t). Because the variations Sq a and 5p a 
are arbitrary and independent of each other in the interval t n < t < t\, we conclude 
that 

pi TT pi IT 

ss = o => q*=°-, Pa = -°-- (3-1.8) 

dp a Oq a 
These are, once more, Hamilton's canonical equations. 



3.2 Applications of Hamiltonian mechanics 

3.2.1 Canonical equations in Cartesian coordinates 

The Lagrangian of a particle moving in a potential V(x, y, z) expressed in Cartesian 
coordinates is 

L= lm(x 2 +y 2 + x 2 )-V(x,y,z). (3.2.1) 



The momenta are 



dL 

Px = -WT- = mx 
ox 
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and so on, and the Hamiltonian is H = p x x+p y y+p z z — L. Expressing this entirely 
in terms of the coordinates and momenta, we obtain 

H=^ n (p 2 x +pl+pl) + V(x,y,z). (3.2.2) 

At this state the velocities x, y, and z are no longer part of our description. 
The canonical equations arc 

dH p x 
dp x to 

and so on, as well as 

dH dV 

= = ~ it 

ox ox 

and so on. Summarizing these equations in vectorial form, we have 

r=— , p=-W. (3.2.3) 

TO 

Notice that the first equation reproduces the relationship between p and r that was 
worked out previously. This first-order system of differential equations is of course 
equivalent to the second-order system mr = — VV, which is just Newton's old law; 
this is obtained by eliminating p from Eqs. (3.2.3). 

3.2.2 Canonical equations in cylindrical coordinates 

The Lagrangian of a particle moving in a potential V (p, (f>, z) expressed in cylindrical 
coordinates was worked out in Sec. 2.4.1. According to Eq. (2.4.3), it is 

L = l -m{p 2 + p 2 j> 2 + z 2 ) - V(p, 0, z). (3.2.4) 

Recall that the cylindrical coordinates are related to the Cartesian coordinates by 
x = p cos (f>, y — p sin <fi, and z — z. 



The momenta are 



dL 
dp 
dL 



P P = gr=mp, 



P4> = -zt = mp 2 (f 



dL 

Pz = ~di = mZ ' 



The Hamiltonian is H = p p p + p^ip + p z z — L. Expressing this entirely in terms of 
the coordinates and the momenta, we obtain 



1 ' - 2 



F= 2^(^ + 7 + ^J +n °' ' z) - (3 - 2 - 5) 

At this stage the velocities p, <ft, and i are no longer part of our description. 
Exercise 3.2. Go through the algebra that leads to Eq. (3.2.5). 



The first set of canonical equations 

P = 



are 




dH 




dp P 


TO ' 


dH 


P4> 


dp<t> 


rap- 


dH 


Pz 



(3.2.6) 
(3.2.7) 
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Notice that these equations reproduce the relationships between the momenta and 
velocities that were worked out previously. The second set of canonical equations 
are 

Pp 

P4> 

Pz 

If wc eliminated the momenta from this system of first-order differential equa- 
tions we would find that they are equivalent to the second-order equations listed in 
Eqs. (2.4.4)-(2.4.6). 



dH 

"dp 
dH 

~~d$ 
dH 

dz 



dV 
dp 
dV 

dV 
dz ' 



mp 3 



(3.2.9) 
(3.2.10) 
(3.2.11) 



Exercise 3.3. Verify this last statement. 



3.2.3 Canonical equations in spherical coordinates 

The Lagrangian of a particle moving in a potential V(r, 9, (f>) expressed in spherical 
coordinates was worked out in Sec. 2.4.2. According to Eq. (2.4.9), it is 

L= X -m{r 2 + r 2 9 2 +r 2 sin 2 9<jy 2 )-V(r, 0,<j>). (3.2.12) 

Recall that the spherical coordinates are related to the Cartesian coordinates by 
x = r sin 9 cos <f>, y = r sin 9 sin <j), and z = r cos 9. 
The momenta are 

dL 

Pr = = mr, 
dr 

po - = mr 9, 

— — - = mr sin </>. 

d(j> 

The Hamiltonian is H = p r r + pg9 + p^tj) — L. Expressing this entirely in terms of 
the coordinates and the momenta, we obtain 

(2 2 \ 

pl + ^ + ^% a ^+V{r,9 1 <t>). (3.2.13) 
r z r 1 sin 9 J 

At this stage the velocities f, 9, and <f) are no longer part of our description. 



Exercise 3.4. Go through the algebra that leads to Eq. (3.2.13). 



The first set of canonical equations are 



r = 



dH 

dp r 


_ Pr 

m ' 


(3.2.14) 


dH 

dps 


Pe 

mr z. 


(3.2.15) 


dH 

dp4> 


mr 2 sin 2 9 


(3.2.16) 
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Notice that these equations reproduce the relationships between the momenta and 
velocities that were worked out previously. The second set of canonical equations 
are 



OH = dV | p 2 | P% 

dr dr mr 3 mr 3 sin 2 9 ' 



(3.2.17) 



dH 8V Pico*® ,„„,„x 
dO ad mr 2 sir 9 

dH dV ,„„,^ 

P * - -W = "^- (3 ' 2 - 19) 

If we eliminated the momenta from this system of first-order differential equa- 
tions we would find that they are equivalent to the second-order equations listed in 
Eqs. (2.4.10)-(2.4.12). 

Exercise 3.5. Verify this last statement. 



3.2.4 Planar pendulum 

For our first real application of the Hamiltonian framework we reintroduce the 
planar pendulum of Sec. 1.3.7. The Lagrangian of this mechanical system was first 
written down in Sec. 2.1; it is 

L = m£ 2 (^6 2 +w 2 cos6^. (3.2.20) 

Here m is the mass of the pendulum, i is the length of its rigid rod, 9 is the swing 
angle, and lo 2 = g/£, where g is the acceleration of gravity. This mechanical system 
has a single degree of freedom that is represented by the generalized coordinate 9. 
The generalized momentum associated with 9 is 

dL 

Pe = — - = m£ 2 0, 
89 

and this equation can be inverted to give 9 is terms of pg. The pendulum's Hamil- 
tonian is H = pg9 — L, or 

H = -p— -m£ 2 u? cos 9. (3.2.21) 

As usual, we find that at this stage the generalized velocity 9 is no longer part of 
our description. 

Exercise 3.6. Verify Eq. (3.2.21). 

The canonical equations for the Hamiltonian of Eq. (3.2.21) arc 

6 - I - 

dH 

p 8 = — — - = ~ m e 2 uj 2 sin6. (3.2.23) 
o9 

If we eliminate pg from this system of equations we eventually obtain the second- 
order equation 9 + lo 2 sin 6* = 0, which is the same as Eq. (1.3.24). 
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Figure 3.3: Phase trajectories of a planar pendulum. The innermost curves have a 
rescaled energy e smaller than lo 2 ; they represent the bounded oscillations of a pendulum 
between the angles ±#o, where e = — lo 2 cos8q. The outermost curves have a rescaled 
energy larger than uj 2 ; they represent a pendulum undergoing complete revolutions instead 
of bounded oscillations. The thin black curve represents the marginal case e — lo 2 . 

The canonical equations must be integrated numerically if we wish to determine 
the functions 9(t) and pg(t). Because they are already presented as a system of first- 
order differential equations for the set (9,pe) of dynamical variables, the numerical 
techniques introduced in Sec. 1.6 can be applied directly. This is one advantage of 
the Hamiltonian formulation of the laws of mechanics: the first-order form of the 
equations of motion means that they are directly amenable to numerical integration. 

Because the pendulum's Hamiltonian does not depend explicitly on time, it is 
a constant of the motion. The dynamical variables of the mechanical system are 
therefore constrained by the equation 



where E is the pendulum's total mechanical energy. This equation describes a one- 
dimensional curve in the two-dimensional phase space of the mechanical system. 
This curve is the trajectory of the pendulum in phase space. A number of such 
phase trajectories are shown in Fig. 3.3. 

To describe what is going on in Fig. 3.3 it is helpful to introduce the rescaled 
momentum p = pg/(m£ 2 ) and the rescaled energy e = E/(m£ 2 ). In terms of these 
variables Eq. (3.2.24) becomes 



and the phase trajectories are obtained by solving this for p(0). There are two 
solutions, one for which p is positive, and the other for which p is negative. 

When e < lo 2 we find that the momentum vanishes whenever 8 = ±#oj the 
amplitude 9q of the motion is determined by e = — lo 2 cos 9 . The motion is then 




— m£ 2 ui 2 cos 9 = E = constant, 



(3.2.24) 



p 2 — uj 2 cos 9 = e, 
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limited to the interval — 9q < 9 < 9q, and we have the usual situation of a pendulum 
oscillating back and forth between the limits ±9q- The phase trajectories represent- 
ing this bounded, oscillatory motion arc closed curves that pass through p = 
whenever 9 achieves its limiting values ±#o- The fact that these phase trajectories 
are closed reflects the fact that the motion of the pendulum is periodic. 

When s > lo 2 we find that turning points can no longer occur: p never changes 
sign, and 9{t) increases (or decreases) monotonically. In this case the pendulum 
does not oscillate; instead it undergoes complete revolutions. The phase trajectories 
representing this unbounded motion arc open curves in the two-dimensional phase 
space. 

The phase trajectory that represents the motion of a pendulum with e = lo 2 
separates the closed curves that represent oscillatory motion and the open curves 
that represent the complete revolutions. This curve is called a separatrix. 

3.2.5 Spherical pendulum 

We next turn to the spherical pendulum, a mechanical system with two degrees of 
freedom. Its Lagrangian was derived in Sec. 2.2.4; according to Eq. (2.4.19), it is 



Here m is the mass of the pendulum, £ is the length of its rigid rod, 9 and <f> give the 
angular position of the pendulum (the angles are defined in Fig. 2.10), and uj 2 = g/£. 
The factor rat 2 in L multiplies each term, and its purpose is simply to give an 
overall scale to the Lagrangian; the factor accomplishes nothing else, and it would 
just come along for the ride in our further developments. To save ourselves some 
trouble we will eliminate this factor by rescaling our quantities. Thus we will deal 
with the rescaled Lagrangian L = L/(m£ 2 ), the rescaled momenta pg = pg/(m£ 2 ) 
and p^ = p r j,/(m£ 2 ), and the rescaled Hamiltonian H = H/(m£ 2 ). 
The (rescaled) Lagrangian is 




L=-9 2 + - sin 2 
2 2 



9 



<ft 2 + lo 2 cos#, 



(3.2.25) 



and the (rescaled) momenta are 



dl 



9 



Pe 



89 
dl 



sin 2 (9 0. 



The (rescaled) Hamiltonian is H 



p e 9 + p 



L, and this becomes 




(3.2.26) 



after expressing the velocities in terms of the momenta. 



Exercise 3.7. Verify Eq. (3.2.26). 



The canonical equations are 







dH 

dpe 



= Pe, 



(3.2.27) 
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Figure 3.4: Phase trajectories of a spherical pendulum. The different curves have 
different values of e but they share the same value of h. The fact that all the trajectories 
are closed indicates that the motion is always bounded and periodic. 



Po 



<9H 



sin 2 9 ' 



dH _ P^cos6 
dO ~ sin 3 6 



lu 2 sin ( 



(3.2.28) 

(3.2.29) 
(3.2.30) 



The last equation implies that is a constant of the motion; we shall set = h = 
constant, as we have done in Eq. (2.4.21). Equations (3.2.27) and (3.2.29) must be 
integrated numerically to determine the functions 9(t) and po(t); when these are 
known Eq. (3.2.28) can be integrated for <fi(t). 

Exercise 3.8. Show that Eqs. (3.2.27) and (3.2.29) are equivalent to the second-order 
differential equation of Eq. (2.4.20). 



The motion of the spherical pendulum in phase space can be described analyti- 
cally. Because = h and H = e are constants of the motion, the phase trajectories 
are described by 

1 h 2 

-uj 2 cos9 = e. (3.2.31) 



2 sin^ 9 



This equation can be solved for pg, and the resulting curves are displayed in Fig. 3.4. 
Here the motion always takes place within the bounded interval 6- < 9 < 6 + , where 
the limits 9± are determined by the values of h and e (the details are provided in 
Sec. 2.4.4). It should be noted that the phase space of the spherical pendulum is, 
strictly speaking, four-dimensional, because it is spanned by the coordinates 9, pg, 
4>, and p<f,. We have reduced this to an effective two-dimensional phase space by 
examining a "surface" p,p = constant = h, and by discarding the <p direction. 
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3.2.6 Rotating pendulum 

The rotating pendulum was first examined in Sec. 2.4.5. Here we have a planar 
pendulum whose pivot point is attached to a centrifuge: it rotates with an angular 
velocity fl on a circle of radius a. The Lagrangian of this mechanical system is 
displayed in Eq. (2.4.25): 



1 



- in 



(a^) 2 + 2a£fl6sm.(6 - fit) + £ 2 9 2 - m£uj' 2 (asmflt - £cos9) 



Before we proceed we simplify this Lagrangian using the rules derived in Sec. 2.5.3: 
We discard the term ^m(afl) 2 because it is merely a constant, and we discard the 
term — m£uj 2 a sin fit because it is the time derivative of (m£uj 2 a/fl) cos fit. The 
simplified Lagrangian is 



L = ml 2 



We simplify this further by rescaling away the common factor of m£ 2 . Our final 
(rescaled) Lagrangian is therefore 



(3.2.32) 



It is noteworthy that this Lagrangian depends explicitly on time; this comes as 
consequence of the fact that the pendulum is driven at a frequency fl. 
The (rescaled) momentum associated with 9 is 



3L 

89 



afl 



sin{9-flt). 



Notice that this is not simply equal to 9; here the momentum differs in an essential 
way from the generalized velocity. The (rescaled) Hamiltonian is H = p9 — L. After 
expressing 9 in terms of p, this becomes 



afl 



sin(6» - fit) 



j 2 cos 9. 



(3.2.33) 



Exercise 3.9. Verify Eq. (3.2.33). 



The canonical equations are 
<9H afl 



9 



dp ' 

<9H 



sin(0 - fit), 



de =-co S (9-flt) 



afl 



sin(0 - fit) 



uj 2 sin( 



(3.2.34) 
(3.2.35) 



These equations must be integrated numerically, and the results can be displayed as 
curves in the two-dimensional phase space spanned by the generalized coordinate 9 
and its (rescaled) momentum p. This is done in Fig. 3.5 for selected values of fl/uu. 



Exercise 3.10. Show that Eqs. (3.2.34) and (3.2.35) are equivalent to Eq. (2.4.26). 



124 



Hamiltonian mechanics 



0.5 - 



a, o - 



-0.5 - 



-1 




-30 -20 -10 10 20 30 

e 




-2.5 * ' ' ' ' ' ' ' ' 1 

-200 200 400 600 800 1000 1200 1400 1600 

e 

Figure 3.5: Phase trajectories of a rotating pendulum. The upper graph shows the 
motion in phase space of a pendulum driven at a frequency = OAlo; the motion in 
configuration space can be seen in the upper graph of Fig. 2.14. The lower graph has 
fl = 0.9a; instead, and the motion in configuration space can be seen in Fig. 2.15. In both 
cases we set (a/£)Q, 2 = 0.2 and use the same initial conditions as in Figs. 2.14 and 2.15. 
The motion in the upper graph is always confined to the interval —30° < 9 < 30°. The 
motion in the lower graph is not bounded: After oscillating a few times the pendulum 
is driven to go through a number of complete revolutions before going back to a brief 
oscillation cycle. In both cases the phase trajectories intersect themselves; this is possible 
because the Hamiltonian depends explicitly on time. 
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3.2.7 Rolling disk 

The rolling disk was first examined in Sec. 2.4.6. Its Lagrangian was obtained in 
Eq. (2.4.30) and it is 

3 

L = -mR 2 9 2 - mg(£ - R6) sin a. 

Here m is the mass of the disk, R its radius, I is the total length of the inclined 
plane, and a is the inclination angle; the disk's motion is represented by the angle 0, 
and these quantities are all illustrated in Fig. 2.16. We can simplify the Lagrangian 
by discarding the constant term — mg£ sin a; we obtain 

L = -mR 2 9 2 + mgR sin a 9. (3.2.36) 

The momentum associated with is p = pg = dL/d9 = ^mR 2 9, and the Hamilto- 
nian is H = p9 — L, or 

2 

H = — t-— - mgR sin a 9 (3.2.37) 
smR* 

after expressing 9 in terms of p. 



Exercise 3.11. Verify Eq. (3.2.37). 



The canonical equations are 

dH 

p = — = mgR sin a. (3.2.39) 

By eliminating p from this system of equations we obtain 9 = |g sin a/ R, which 
is the same as Eq. (2.4.31). A particular solution to this second-order differential 
equation was displayed in Eq. (2.4.32). From this solution it is easy to calculate 

Pit)- 



Exercise 3.12. Obtain the solution to the canonical equations which enforces the 
initial conditions 6(t = 0) = and p(t = 0) = po, where po is an arbitrary constant. Plot 
the motion of the disk in phase space for selected values of po, and verify that your plots 
look similar to those featured in Fig. 3.6. Finally, show that the phase trajectories are 
described by the equation 

p 2 - 3m 2 gR 3 sin a 9 = 3mR 2 E, 
where E is the disk's total mechanical energy; find the relationship between E and po- 



3.2.8 Kepler's problem 

Kepler's problem was first considered in Sec. 1.5. It was revisited in Sec. 2.4.7, where 
the Lagrangian of two bodies subjected to their mutual gravity was decomposed into 
a centre-of-mass Lagrangian that governs the overall motion of the centre of mass, 
and a relative Lagrangian that governs the relative separation r between the two 
bodies. The relative Lagrangian was expressed in polar coordinates in Eq. (2.4.36), 
which we copy here: 

L=l^r 2 +r 2 ^ 2 ) + ^. (3.2.40) 
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Figure 3.6: Phase trajectories of a rolling disk. The curves with p positive represent a 
disk rolling down the inclined plane. The curves with p negative represent a disk rolling 
up. 



The quantity fi = m\rnil(jn\ + rn 2 ) is the reduced mass of the gravitating system, 
and M = mi + is the total mass. The distance between the two bodies is r, 
and 4> is the orbital angle. Our effective one-body system possesses two degrees of 
freedom. 

The momenta associated with r and d> are 



Pr 



dr 
dL 



fir, 



fir 2 (j>, 



and the Hamiltonian is H 



p r r + p^cj) - L, or 

,,2 



H 



Pv 



Pi 



GfiM 



2/x 2fir 2 



after eliminating the velocities in favour of the momenta. 



Exercise 3.13. Verify Eq. (3.2.41). 



(3.2.41) 



The canonical equations are 



dH p r 

dp r [I ' 

dH _ p^_ 

dpcf, fir 2 ' 
dH _ p\ GfiM 
dr fir 3 r 2 



(3.2.42) 
(3.2.43) 

(3.2.44) 
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Figure 3.7: Phase trajectories of Kepler's problem, in a two-dimensional subspace of 
the complete four-dimensional phase space. The different curves have different values of e 
but they share the same value of h. The fact that all the trajectories are closed indicates 
that the motion is bounded and periodic. The curves are most easily produced by using 
the results of Sec. 1.5.9: We use the parametric representation r = p/(l + ecos0) and 
p — r = e*J GM/psin (f>, where p is the semilatus rectum and e the eccentricity. In terms 
of these we have h = \JGMp and e = — GM(1 — e 2 )/(2p). The phase trajectories displayed 
here have eccentricities of 0.3, 0.4, and 0.5, respectively; they all share the same value of 
V- 




(3.2.45) 



The last equation implies that p<p is a constant of the motion; we shall express this 
as p^/fi = h = constant, as we have done in Eq. (2.4.38). Equations (3.2.42) and 
(3.2.44) can be shown to be equivalent to Eq. (2.4.37). 



Exercise 3.14. Verify this last statement. 



The solutions to the equations of motion were studied back in Sec. 1.5. The 
motion in phase space is described by the equation H = E = constant, which 
expresses the fact that H also is a constant of the motion. Introducing the rescaled 
quantities p = p r j \x and s — E//i, this equation states that 

1 2 h 2 GM 

2 P + 2^-~ = £ - 

This equation can be solved for p and the result is displayed in Fig. 3.7. 

3.2.9 Charged particle in an electromagnetic field 



For our last application we consider a particle of charge q moving in an electric 
field E and a magnetic field B. The fields can be expressed as E = —dA/dt — V<& 
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and B = V x A in terms of potentials $ and A; and as we saw in Sec. 2.6, the 
Lagrangian of the particle is 

L=^mv 2 -q^ + qA-v, (3.2.46) 

where v 2 = v-v = x 2 +y 2 + z 2 . The momentum vector p associated with the position 
vector r was obtained back in Eq. (2.6.5); it is p = mv + qA. The Hamiltonian is 
H = p ■ v — L, or 

H = -L (p - qA) ■ (p - qA) + q$. (3.2.47) 



Exercise 3.15. Verify Eq. (3.2.47). 



The canonical equations governing the evolution of x and p x are 

3H 1 

x = ^ = -(p x ~qA x ), (3.2.48) 
op x m 

dH q, OA o>$ , nn , n , 

Px - - ir = ±(p-qA)- — -q—, 3.2.49 
ox m ox Ox 

and similar equations can be obtained for the pairs (y,p v ) and (z,p z ). The second- 
order differential equation that is obtained by eliminating p x from the system of 
Eqs. (3.2.48) and (3.2.49) is 

m'x = q(E + vx B) x , 

and this is the x component of the Lorentz-force law. The other components are 
obtained by similar manipulations. 



Exercise 3.16. Go through the algebra that leads to Eqs. (3.2.48) and (3.2.49), and 
then repeat these calculations for the other four canonical equations. Finally, show that 
these equations are indeed equivalent to the Lorentz-force law, mx = q(E + v x B). Be 
warned: this last calculation can be a bit tricky! 



3.3 Liouville's theorem 

3. 3. 1 Formulation of the theorem 

We wish to examine the motion of a large number N of identical particles in phase 
space; each particle has its own position and momentum, but all are subjected to 
the same potential V. We may imagine that the N particles co-exist peacefully, 
without interacting with one another. Or we may imagine that the particles are 
in fact mental copies of one and the same particle, on which we are carrying out TV 
separate experiments. In all cases we shall refer to the TV particles as an ensemble 
of particles, and we wish to follow the motion of this ensemble in phase space. 

Supposing (for concreteness) that each particle possesses three degrees of free- 
dom, which it would if it were to move in a three-dimensional space, we could form 
a 6./V-dimensional phase space of all positions and momenta of all the particles, and 
we could display the motion of the whole ensemble as a trajectory in this super 
phase space. We shall not follow this strategy, although it is a viable one. Instead, 
we will simultaneously represent the motion of all N particles in the six-dimensional 
phase space of an individual particle (which one we pick does not matter, because 
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Figure 3.8: The initial state of motion of an ensemble of N identical particles is rep- 
resented by N representative points in the phase space of an individual particle. These 
points are distributed within a bounded region 1Z of the phase space. 

the particles are all identical); this individual phase space is spanned by the three 
position variables and the three momentum variables. We will have, therefore, a 
collection of N separate trajectories in phase space. 

We have seen that a point in phase space — the phase space of an individual 
particle — gives a complete representation of the state of motion of that particle at 
an instant of time. By identifying the point we automatically know the particle's 
position and momentum, and this is all there is to know about the state of motion 
of the particle at that instant of time; the equations of motion then tell us how 
the state of motion will change from this time to the next time. As was mentioned 
previously, the particle will trace a trajectory in phase space, and each point on this 
curve will represent a state of motion corresponding to a given instant of time. 

Suppose that we wish to represent, at an initial moment t — 0, the state of 
motion of our ensemble of N particles, and that we wish to do so in the phase space 
of an individual particle. We will need to identify N points in the phase space, 
and each of these points will represent the state of motion of one of the particles. 
We will call them representative points. Because we give each particle its own set 
of initial conditions, the representative points will be spread out in phase space, 
and they will define a region 1Z of phase space. We will assume that this region is 
bounded; this is illustrated in Fig. 3.8. 

Each particle within the ensemble moves according to Hamilton's equations, 
and each representative point traces a trajectory in phase space. Because the initial 
conditions are different for each particle, each trajectory is different. In a time t 
the initial region 11(0) of phase space will be mapped to a distinct region H(t); this 
mapping is illustrated in Fig. 3.9. The shape of TZ(t) will in general be very different 
from the shape of the initial region 1Z(0). But according to Liouville's theorem: 

The "volume" of the region lZ(t) of phase space, 




is independent of the time t] the volume does not change as the region 
lZ(t) evolves in accordance with the motion of each representative 
point in phase space. 
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Figure 3.9: An initial region 1Z(0) of phase space containing the N representative points 
is mapped by Hamilton's equations to a new region TZ(t). The shape of the new region can 
be very different from the shape of the initial region. But according to Liouville's theorem, 
their phase-space "volumes" are equal. 

So Liouville's theorem states that while the Hamiltonian evolution of the ensemble 
will produce a deformation of the region TZ(t), the evolution will nevertheless pre- 
serve its "volume" , as defined by the integration over TZ(t) of the "volume element" 
dV = dqidq2 ■ ■ ■ dpidp2 • • • in phase space. 

The proof of this important theorem will be presented below. Illustrations of 
the theorem are displayed in Fig. 3.10, which shows how an initial region 72(0) 
of a two-dimensional phase space evolves over time. It is important to note that 
Liouville's theorem is formulated in phase space, and that its validity is therefore 
restricted to phase space. An attempt to formulate such a theorem in configuration 
space, or in a position-velocity space, would fail: Volumes in such spaces are not 
preserved under the time evolution of the ensemble. 

3.3.2 Case study: Linear pendulum 

The examples displayed in Fig. 3.10 involved a nonlinear pendulum, and the nonlin- 
earities of the dynamics produced interesting distortions of the initial (rectangular) 
region 72.(0) of the system's phase space. These distortions, however, are difficult 
(probably impossible) to describe mathematically, and this means that the validity 
of Liouville's theorem would be difficult to check directly. 

To help build confidence in these new ideas we will simplify the problem further 
and eliminate the nonlinear aspects of the dynamics. We will therefore examine the 
motion of an ensemble of linear pendula. Each pendulum possesses the Hamiltonian 

H= 1 -p 2 + 1 -^0\ (3.3.1) 

which can be obtained from Eq. (3.2.21) by (i) invoking the approximation cos# = 
1 — \9 2 , (ii) discarding an irrelevant constant term, and (iii) rescaling the variables 
according to H/ (mi 2 ) — * H and po / (m£ 2 ) — > pg = p. 
The canonical equations are 

9=p, p=-uj 2 9, (3.3.2) 

and they are equivalent to the second-order differential equation 9 + uj 2 9 = that 
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Figure 3.10: Evolution of a region lZ(t) in a two-dimensional phase space; the mechan- 
ical system is the planar pendulum of Sec. 3.2.4. For all three plots the initial region 1Z(0) 
is the rectangular region near the top of the page, and TZ(t) is drawn at three successive 
times. Each region consists of 400 representative points, drawn as open circles, and the 
motion of each point in phase space is determined by numerical integration of Hamilton's 
equations. Two bounding phase trajectories are also shown to guide the eye. In the first 
(upper) graph the initial conditions are such that the motion each pendulum is bounded, 
limited to an interval —6*0 < < 8o- In the second (middle) graph the initial conditions 
are such that the motion of each pendulum is not bounded; each goes through complete 
revolutions. In the third (lower) graph the initial conditions are such that about half 
the pendula undergo bounded motion, while the other half undergo unbounded motion. 
All three graphs reveal a significant distortion of the region lZ(t) as the motion of each 
pendulum proceeds; but the area of this region is the same at all times. 
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governs simple harmonic motion. The general solution to Eqs. (3.3.2) is 

9(t) = 9(0) cos cut + sin ut, (3.3.3) 
p(t) = p(Q)coswt-w6(0)smu}t, (3.3.4) 

where 9(0) = 9(t — 0) is the initial position of a pendulum sampled from the 
ensemble, and p(0) = p(t = 0) is its initial momentum. The energy of each pendu- 
lum is conserved, and the trajectory of each pendulum in phase space is an ellipse 
described by 

\p 2 + l r W = e=\p\0)+ l r W(0). 

By varying the initial conditions among all the pendula within the ensemble, we 
trace ellipses of varying shapes and sizes. 

Each pendulum within the ensemble has its own set of initial conditions 9(0) 
and p(0). The spread of initial conditions in phase space defines the initial region 
1Z(0). Suppose that we choose initial conditions such that the N values for 9(0) are 
centered around 9(0) —0 and are within a deviation ag away from zero, either in the 
negative or positive direction. Suppose also that the ./V values for p(0) are centered 
around p(0) = po and are within a deviation a p away from p . What we have, 
then, is a region 1Z(0) in phase space that is centered at (9,p) = (0,Po) and has a 
typical extension of ag in the position direction, and a typical extension of a p is the 
momentum direction. This region will evolve to TZ(t) in a time t, as each pendulum 
within the ensemble moves in phase space. We wish to describe this evolution, and 
in particular, we wish to show that the "volume" of TZ(t) is independent of time. 

Concretely we choose the boundary of 7^(0) to be described by an ellipse of 
semiaxes ag and a p , centered at 9(0) = and p(0) = po- (It is important to 
understand that this ellipse has nothing to do with the elliptical motion of each 
pendulum in phase space. We have two unrelated ellipses: one representing the 
motion of each pendulum in phase space, the other representing the distribution of 
initial conditions.) We describe this boundary by the parametric equations 

9(0; a) = — a & cos a, p(0; a) = p a + a, p sin a, (3.3.5) 

in which the parameter a ranges from to 2ir. All the representative points arc 
initially located within this ellipse, and the region 71(0) is therefore a solid ellipse; 
this is illustrated in Fig. 3.11. The phase-space "volume" of this region is, in this 
two-dimensional context, the surface area of the solid ellipse. This "volume" can 
be calculated as 

V(0) = [ d9(0)dp(0) 

P+ (0)d9(0)+ / p_(0)d6(0). 

The first integral is the area under the upper branch of the ellipse (the one for which 
P > Po), and the second integral is (minus) the area under the lower branch (the 
one for which p < p ). This can be expressed cleanly as 

= / [po + cr p sin a] [ag sin a] da, 
Jo 



and integration gives 



V(0) = iraga. 



(3.3.6) 
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Figure 3.11: The initial region 1Z(0) in phase space has an elliptical boundary. The 
ellipse is centered at (8,p) = (0,po) and it has semiaxes ag and a p . An angle a (not shown) 
parameterizes the position on the boundary. 

the expected result for an ellipse of semiaxes ag and <jq. This is the phase-space 
"volume" of our initial region 7?.(0). We now wish to determine how this region 
evolves in time, and how its volume changes. 



Exercise 3.17. Verify Eq. (3.3.6). 



As time moves forward each point 9(0; a), p(0; a) on the boundary of the region 
1Z(0) is mapped to a corresponding point 9(t; a), p(t; a) on the boundary of the new 
region TZ(t). The coordinates of the new point are given by Eqs. (3.3.3) and (3.3.4), 
which we write as 

v(0- a) 

0(t;a) = 6(0; a) cos cot + Fy ' ; smut, (3.3.7) 

u> 

pit; a) — p(0; a) cosojt — uj9(0; a) smut. (3.3.8) 

The new regions TZ(t) are displayed in Fig. 3.12 for selected values of t. Their 
"volume" is given by 

V[t) = { d6{t)dp{t) 
Jn(t) 

= I p { t]a)^±da. 

After involving Eqs. (3.3.7), (3.3.8) and performing the integration, we arrive at 

V{t) = Trowp, (3.3.9) 

the same result as in Eq. (3.3.6). The volume of the phase-space region TZ(t) is 
indeed independent of time. 



Exercise 3.18. Go through the calculational steps that lead to Eq. (3.3.9). 
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Figure 3.12: The initial region 1Z(0) in phase space is mapped to a new region lZ(t) 
after a time t. These regions are shown for cut — (uppermost ellipse), ut = n/4, cut — n/2 
(rightmost ellipse), ut = 37r/4, and so on. 



3. 3. 3 Proof of Liouville 's theorem 

There are, in fact, two versions of Liouvillc's theorem. The first version is concerned 
with a quantity p, the density of representative points in phase space, which we shall 
introduce below; it states that under a Hamiltonian evolution, 

g = 0, (3.3.10) 

so that p is a constant of the motion. The second version is concerned with the 
volume V(t) of a region TZ(t) of phase space that is defined by an ensemble of 
representative points; it states that V(t) is constant under a Hamiltonian evolution. 
The second version of the theorem is a corollary of the first. We will prove the first 
version first, and then obtain the second version. 

We have a region TZ(t) of phase space that contains a large number N of represen- 
tative points; this region has a volume V = /^/^ dV, where dV = dq\dq 2 ■ ■ ■ dp\dp% ■ ■ 
is the element of phase-space volume. We imagine that N is sufficiently large that 
we can introduce a notion of phase-space density p of representative points; this, 
by definition, is the number dN of representative points contained within a small 
region of phase space, divided by its volume dV . We have, therefore, p — dN/dV, 
and the density can vary from point to point in phase space: p = p(q ai p a ,t); we 
also allow the density to depend explicitly on time. 

The phase-space density p plays essentially the same role here as the density 
of electric charge p e plays in electromagnetism. If we introduce a velocity field 
v = (<ji, fa, ■ ■ ■ jjHjP2j • • •) in phase space, then the current density j = pv will play 
essentially the same role here as the electric current density j e plays in electromag- 
netism. It is known that in electromagnetism, the charge and current densities are 
related by an equation of continuity, 
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We will show that a very similar equation of continuity applies to p and j in phase 
space. In clcctromagnctism the equation of continuity is a differential statement of 
charge conservation; in phase space it will be a differential statement of the fact 
that the number of representative points is conserved. 



Exercise 3.19. Consult your favorite textbook on electromagnetism and review its 
derivation of the equation of continuity for electric charge. 



Consider a region V of phase space which is bounded by the "surface" S. This 
region is completely arbitrary but it is assumed to be fixed in time; unlike the 
region 1Z(t) considered previously, this one does not move around in phase space. 
The representative points contained in lZ(t) do move, however, and in time they will 
move in and out of the region V. The number of representative points contained in 
V at any given time t is given by the phase-space integral J v pdV. The number of 
representative points that move out of V per unit time is then given by 



If the total number of representative points is to be conserved, this number must 
be equal to the number of representative points that cross the bounding surface S, 
in the outward direction, per unit time. By definition of the current density j, this 
is 



where da is an element of "surface area" in phase space; this vector is directed 
along the outward normal to the surface, and its magnitude is equal to the area of 
an element of surface in phase space. Equating these two expressions gives 



We next use the phase-space version of Gauss's theorem to express the right-hand 
side as a volume integral, 



Jv 

where V = (d/dq\,d/dq2, • • • , d/dpi,d/dp2, ■ ■ ■) is the gradient operator in phase 
space. We now have 



and since this equation must be valid for all regions V of phase space, we conclude 
that 




j • da, 



s 








or 
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The phase-space coordinates of the representative points change in accordance 
with Hamilton's equations. We may therefore substitute q a = dH/dp a and p a — 
—dH/dq a into the preceding equation. We obtain 







dp 



+E 

a 



dt 

dp 



dp 

dt 



+E 



d 

dq a 
dp dH 
dq a dp a 
dp dH 
dq a dp a 



dH\ 
P dp-J 



d 

dp a 

dp dH 
dp a dq a 
dp dH' 
dp a dq a . 



8H\ 

P dq-) 



d 2 H 

dq a dp a 



d 2 H 

dp a dq a 



or 







dp 
dt 



+ E 



dp . dp . 

T. Qa T T. Pa 

dq a dp a 



after involving Hamilton's equations one more time. 

We have obtained the first version of Liouville's theorem: If the phase-space 
density p is a function of q a , p ai and t, then by virtue of the chain rule its total 
time derivative is 



dp 
~d~t 



= E 



dp . 
dq a 



dp . ■ 

dp a 



+ 



dp 
dt' 



(3.3.11) 



and according to our previous results, this is zero. We have therefore established 
Eq. (3.3.10) on the basis of the equation of continuity in phase space. 

To arrive at the second version of Liouville's theorem, consider the N repre- 
sentative points that are contained in the moving region TZ(t) of phase space. By 
definition of the phase-space density, we have 



N 



-J 

Jn(t) 



pdV, 



and we know that this number is preserved as we follow the evolution of TZ(t) over 
time. We now also know that the density p is a constant of the motion. This 
means that if, for example, the density is initially chosen to be uniform over 71(0), 
then it will stay uniform over lZ(t) throughout the Hamiltonian evolution. In this 
case we may bring p outside of the integral, and we obtain the statement that 
N/p = j n t t \dV = V(t) is preserved during the evolution. This is the second 
version of Liouville's theorem. 



3.3.4 Poisson brackets 



The expression of Eq. (3.3.11) for the total time derivative of the phase-space den- 
sity, 

dp 
dt 



dp 
~dt 



E 



dp . 
dq a 



dp . ' 

dp a 



dp 

dt 



+ E 



dp dH 
dq a dp a 



dp dH 
dp a dq a 



is in fact a mathematical identity that holds for any function p(q a ,p a ,t) defined in 
phase space. Because this expression is so general, it occurs often, and it has proved 
convenient to introduce a notation to recast it in a more compact form. 

Let f(q a ,Pa,t) and g(q a ,p a ,t) be any two functions on phase space. Their 
Poisson bracket is defined by 

df_dg_ _ df dg 
s dq a dp a dp a dq a/ 



E 



(3.3.12) 
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The Poisson bracket possesses the following properties: It is antisymmetric, 

[<?, /] = -[/, g\\ (3.3.13) 
it is linear with respect to each of its arguments, 

l.fi + f2, g] = [fug] + [f2,g], [f, gi + $a] - [/, .91] + [/, g 2 \; (3.3.14) 

it satisfies the product rule of differential calculus, 

1/1/2,5] = .h[h,g] + [fug]f2, [f,gM - gi[f,g 2 ] + [/,<?i]<7 2 ; (3.3.15) 

and finally, it satisfies the Jacobi identity, 

[f,[g,h]] + [h,[f,g]] + [g,[h,f]]=0. (3.3.16) 



Exercise 3.20. Show that these are all true properties of the Poisson bracket. Be 
warned: To establish the Jacobi identity requires a lengthy calculation. 

Particular applications of the Poisson bracket are 

IA*J~& Ift-J-g- (33.17) 
Special cases of these identities are 

[la, Qb] = 0, [q a ,Pb] = Sab, [Pa,Pb] = 0. (3.3.18) 

Exercise 3.21. Verify Eqs. (3.3.17) and (3.3.18). 

In terms of the Poisson bracket, the total derivative with respect to time of a 
function f(q a ,p a ,t) is given by 

f t = % + [f,H]. (3.3.19) 

If we apply this identity to the Hamiltonian H we obtain dH/dt = dH/dt+[H, H] = 
dH/dt, by virtue of the antisymmetric property of the Poisson bracket. If the 
Hamiltonian does not depend explicitly on time, we obtain the statement dH/dt = 
and the conclusion that the Hamiltonian is a constant of the motion. This is a well- 
known result by now, but notice how quickly the result follows from the Poisson- 
bracket formalism. 

Exercise 3.22. Verify Eq. (3.3.19). Then show that it leads to the expected answers 
for dq a /dt and dp a /dt. 



3.4 Canonical transformation 

3.4-1 Introduction 

A theme that has been central to our development of Lagrangian and Hamiltonian 
mechanics is the arbitrariness of the generalized coordinates q a that are adopted to 
describe the motion of a mechanical system. The Euler-Lagrange equations 
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and the canonical equations 

. _ dH . dH 

la — 7; ; Pa 7* 

OPa Oq a 

are all invariant under a transformation of the generalized coordinates, from the set 
q a of old coordinates to any set Q a of new coordinates; these can by any functions 
Q a (qi, q2, •••) of the old coordinates. 

In this section we show that the Hamiltonian formulation of the laws of me- 
chanics admits a much wider class of possible transformations. In this context it is 
possible to change the phase-space coordinates from an old set (q a ,Pa) to a new set 
(Q a ,P a ), with 

Q a = Qa(qi,q2,---,Pi,P2,---), P a = P a (qi,q2,---,Pi,P2,---)- (3.4.1) 

Notice that the new generalized coordinates Q a are now functions of the old coordi- 
nates and the old momenta; the new generalized momenta P a also are functions of 
all the old phase-space coordinates. Under some conditions, which will be specified 
below, such a transformation will leave the canonical equations invariant: In the 
new system of phase-space coordinates there will exist a transformed Hamiltonian 
H'(Q a ,P a ,t) such that 

Under these conditions the transformation is known as a canonical transformation; 
transformations of the phase-space coordinates that are not canonical have no value, 
and they will not be considered. 

Canonical transformations have the interesting and useful property that they 
leave the element of phase-space volume invariant. Thus, 

dV = d qi dq 2 ■ ■ ■ d Pl dp 2 ■■■ = dQ x dQ 2 ■ ■ ■ dP x dP 2 ■■■■ (3.4.3) 

In other words, the Jacobian of the transformation is equal to one. This gives 
us a means of checking whether a specified transformation is canonical or not: If 
the Jacobian of the transformation is not equal to one, the transformation is not 
canonical. This property of canonical transformations is rather deep, and it implies 
that the validity of Liouville's theorem is not restricted to a particular choice of 
phase-space coordinates; the volume of a region lZ(t) of phase space is invariant 
under a canonical transformation. 

Because a canonical transformation produces new coordinates that are a mix- 
ture of old coordinates and old momenta, they can dramatically alter the physical 
meaning of the phase-space coordinates. Thus, a given Q a may not necessarily 
represent a position variable, and a given P a may not represent a momentum vari- 
able. A trivial example is the canonical transformation Q a = p a , P a = —q a , which 
clearly leaves the canonical equations invariant; here the new coordinates are the 
old momenta, the new momenta are the old coordinates, and the new phase-space 
coordinates do not retain their traditional physical meaning. Because the new "co- 
ordinates" Q a and the new "momenta" P a may not have straightforward physical 
interpretations after a canonical transformation, it is customary to refer to the new 
phase-space coordinates simply as conjugate variables. 



3.4-2 Case study: Linear pendulum 



Before we present the general theory of canonical transformations in the next sub- 
section, we shall take the time to get acquainted with some of the fundamental ideas 
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by examining a specific example. We return once more to the linear pendulum of 
Sec. 3.3.2, with its Hamiltonian 

H= l -p 2 + l -to 2 q 2 , (3.4.4) 

where we have identified the generalized coordinate q with the swing angle 8. The 
canonical equations for this mechanical system arc 

p = -—=-J> q . (3.4.5) 

We intend to show that a canonical transformation can turn this rather simple 
mechanical system into a completely trivial one. Solving for the motion of the 
trivial system will allow us to find the solution to Eqs. (3.4.5) without having to 
solve these equations directly. This, in a nutshell, is the power and purpose of 
canonical transformations. 

Let us consider the following transformation of the phase-space coordinates: 

Q = arctan(^), P= _L(p2+ w y). (3 4 6) 

The new "momentum" P is proportional to the Hamiltonian; a curve P = constant 
is therefore represented as an ellipse in the old phase space. A curve Q = constant, 
on the other hand, is represented as a straight line that passes through the origin; 
this line has a slope p/q — lo/ tanQ, and Q is an angle relative to the p axis. The 
inverse transformation is 

1 2P 

q=\ — sinQ, p = V2wPcosQ. (3.4.7) 
V oj 



It is easy to check that the transformation has a unit Jacobian. This is given by 
J = 



dq/dQ dq/dP 
dp/dQ dp/dP 

= cos 2 Q + sin 2 Q = 1 



y/2P/u cos Q sin Q/V2loP 
-V2uPsmQ yfujj2P cos Q 



and J is indeed equal to one. This gives us a successful partial check on whether 
the transformation is properly canonical. 

Exercise 3.23. Check that Eq. (3.4.6) is the inverse transformation to Eq. (3.4.7). 
Then check all the partial derivatives that have been involved in the computation of the 
Jacobian. 

The transformation of Eq. (3.4.6) will be canonical if and only if it preserves the 
form of the canonical equations. We shall now show that this is indeed the case. 
We will find that the evolution equations for Q and P are given by 

with a Hamiltonian now expressed as 

H = loP, (3.4.9) 

which follows by substituting Eq. (3.4.6) for P into Eq. (3.4.4) for H. In this 
particular instance of a canonical transformation, the new Hamiltonian H' is the 
same as the old Hamiltonian H. 
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We can verify the results of Eq. (3.4.8) by computing Q and P directly from 
their definitions in Eq. (3.4.6). We begin with the relation tanQ = uiq/p, which we 
differentiate with respect to t. We get 

(1 + tan z Q)Q = — , 

p p z 

and if we now involve Eq. (3.4.5), this becomes 

(1 + tan 2 Q)Q = uj [\ + t ^p S j = w(l + tan 2 Q). 

This gives, finally, Q = ui, as we had stated in Eq. (3.4.8). The right-hand side 
of this equation happens to be equal to dH/dP, and we have recovered one of the 
two canonical equations. The second equation follows much more easily. Because 
P = H/u) it is obvious that its equation of motion is P = 0, as was stated in 
Eq. (3.4.8). The right-hand side of this equation happens to be equal to dH/dQ, 
and we have recovered our second canonical equation. 

The main purpose of the canonical transformation of Eq. (3.4.6) is to bring the 
Hamiltonian to the simple form of Eq. (3.4.9). This Hamiltonian is proportional 
to the new momentum P, and it does not depend on the new coordinate Q. As 
a result, the equations of motion are exceptionally simple, and they can be solved 
easily: The new momentum is a constant of the motion and the new coordinate Q 
behaves in time according to Q(t) = uot + 5, where 8 is a constant of integration. 
The transformation has therefore turned the original problem into a very simple 
one. With the solution to the simple problem in hand, we may return to the 
original problem and express its solution as 

l2p 

q(t) = \ — s\n{ut + 6), p(t) = V2ujP cos(wi + S), 
V uj 

by substituting our solution for Q{t) into Eqs. (3.4.7). Our linear pendulum evi- 
dently undergoes simple harmonic motion. The frequency of the motion is u), and 
its amplitude is yj2P/bj. 

3.4-3 General theory of canonical transformations 
When is a transformation of the phase-space coordinates, 

Qa = Qa(qb,Pb,t), P a = P a (q b ,p b ,t), 

a canonical transformation? The fundamental criterion is that the transformation 
must preserve the form of Hamilton's canonical equations: The transformation must 
produce a new Hamiltonian H' such that 

• _dIP_ . _ dlP 

Qa ~dP a > dQ a - 

The question is: Under what conditions does this occur? We will provide a number 
of answers to this question, ranging from the formal to the practical. 

Let us recall from Sec. 3.1.4 that Hamilton's equations for the original set (q a ,Pa) 
of phase-space coordinates can be derived on the basis of Hamilton's principle of 
least action. The principle can be expressed in the form 

ft 



5 I' {y,Padq a -Hdt \ =0; 

Jt \ n J 
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the variations Sq a (t), Sp a (t) are all independent, and they are subjected to the 
boundary conditions Sq a (to) = Sq a (ti) = 0. If Hamilton's equations are to hold 
also for the new set (Q a , P a ) of phase-space coordinates, they must also follow from 
Hamilton's principle. We must then have, simultaneously, 



J to V „ 



dQ a -H'dt) = 



here it is the variations SQ a (t), SP a (t) that are taken to be independent and sub- 
jected to the boundary conditions 5Q a (t n ) — 8Q a {t\) — 0. The two formulations 
of Hamilton's principle will be compatible with each other if and only if the inte- 
grands J2 a Pa dq a — H dt and J2a Pa dQ a — H' dt differ by the total derivative dF\ 
of a function F 1 {q a ,Q ai t) of the old and new coordinates. For we would have, in 
this case, a difference of integrals given by 

dF l =F 1 {q a {t l ),Q a {t l ),t 1 ) -fi(<Za(to),Qa(*o),*o), 

to 

and 5 dFi — would follow immediately by virtue of the boundary conditions 
on the variations Sq a and SQ a . 

The first answer to our question is therefore this: A transformation of the 
phase-space coordinates is a canonical transformation when there exists a function 
F\{q a , Q a ,t) such that 

Y,Padq a - Hdt = J2 P « d ®« -H'dt + dFi. (3.4.10) 

a a 

The function Fi(q a , Q a ,t) is called the generating function of the canonical trans- 
formation. This is a formal answer to our question; we will provide more practical 
answers at a later stage. 

The total derivative of F\ can be expressed as 

,rn dFi , dF l dF x , 

On the other hand, Eq. (3.4.10) can be rewritten as 

dFi = Y,p- d i--Y, p - + ( H ' - H ) dt 

a a 

Because both equations must be true, we obtain the identifications 

Pa = ^, P a = -^ H ' = H + 9 4- (3-4.11) 
oq a oQ a dt 

The first two equations give us the old momenta p a and the new momenta P a in 
terms of the derivatives of the generating function. The last equation gives us the 
new Hamiltonian H'; if the generating function does not depend explicitly on time, 
the new Hamiltonian is the same as the old. 

As a trivial application of the foregoing, let us consider the generating function 
Fi = qbQb- The old momenta are p a = dFi/dq a = Q a and the new momenta are 
P a = —dF\ I dQa = —q a - This generating function therefore produces the trivial 
transformation Q a = p a , P a = —q a that was encountered previously. This is a 
canonical transformation because it is generated by the function F\. Because this 
function does not depend explicitly on time, the transformation does not change 
the Hamiltonian: H' = H . And the transformation, evidently, preserves the form 
of the canonical equations. 
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A more interesting application involves the function F\ — |a>g 2 cotanQ, which 
generates the transformation of Eqs. (3.4.6) and (3.4.7). We have 

dFi uq 

V = -w- = cotanQ = - 

oq tan Q 

and 

p — - - H 1 + 4) - i<» 2+ " 2 ' 2 ), 

as was anticipated in Eqs. (3.4.6). Because F\ does not depend explicitly on t, we 
have that H' = H = ujP, as was stated in Eq. (3.4.9). 



3.4-4 Alternative generating functions 

It is possible to introduce new generating functions that depend on an alternative 
choice of variables. Consider, for example, the new function 

F 2 = F 1 +J2QaPa- 

a 

Its total derivative is 

dF 2 = dF 1 +J2 p «dQa + J2Q adPa 

a a 
a a a a 

= Y.P« d ^ + Y.Q« dP « + { - H ' '-H)dt; 

a a 

in the second line we substituted a previous expression for dF\ , and in the last line 
we canceled out the terms ^ a P a dQ a . The fact that dF 2 involves the differentials 
dq a , dP ai and dt informs us that F 2 must be a function of q a , P a , and t. We have, 
therefore, 

F 2 = F 1+ Y,QaPa = F 2 {q a ,P ai t), (3.4.12) 

a 

and this new generating function does indeed depend on a different set of variables. 
Our previous calculation allows us to make the identifications 

dFi dFi dFi 

Pa=^, Qa = J^, H'=H+^. (3.4.13) 

dq a oP a dt 

This freedom to introduce alternative generating functions adds flexibility to the 
framework of canonical transformations. We will make use of this in the next 
section. 



Exercise 3.24. Consider the new generating function F3 = Fi — ~}2 a q a p a - On which 
variables does F3 depend? Find expressions for P a , q a , and H 1 in terms of partial deriva- 
tives of F3. 



Exercise 3.25. Consider now the new generating function F4 = Fi + 5H a QaPa — 
^aQaPa- On which variables does F4 depend? Find expressions for q a , Q a , and H' in 
terms of partial derivatives of F4 . 
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3.4-5 Direct conditions 

It is rarely convenient to test whether a transformation is canonical by attempting 
to find its generating function. More direct tests are available, fortunately, and these 
do not require knowledge of the generating function. We shall describe these tests in 
this and the following subsection. For simplicity we assume that the transformation 
does not depend explicitly on time; this means that H' = H. 

The transformation Q a = Q a (qb,Pb) implies that the time derivative of the new 
coordinates can be expressed as 

A \ 9Q a . \ ^ 8Qa . 

6 6 

dQg OH 8Qa OH 

^ dq b dp b g pb Q qh 

If the transformation is canonical, this will be equal to 8H/0P a . With H written 
as a function of the old phase-space coordinates, this is 

OH >— ^ OH dqb >r-^ OH dpb 

W a ^^WbdP a + \^dpbdPa' 

Hamilton's equations therefore imply 

• OH 

v /9Q a dp b \8H ^fOQg Oq b \OH 

^\d qb dPajdpb ^\dp b dP a )dq h 

This equation will be satisfied if and only if 



J^-Qa(qb,Pb) = W^-Pb(Qa,Pa), Qa(<?6, Pb) = ~ T^qbiQa, Pa) ■ (3.4.14) 

aq b ai^a oPb c/r a 

This first set of conditions must therefore be met if the transformation is to be a 
canonical transformation. 

The second set of conditions is obtained by starting instead with the transfor- 
mation P a = P a (qb,Pb)- This time we have 

p ^OPa.^OPa. 

b qb b Pb 

OPa OH y, OPg OH 

4^ 0q b 0p b 4^ dpb 0q b ' 

b b 

If the transformation is canonical, this will be equal to —0H/0Q a - With H written 
as a function of the old phase-space coordinates, this is 

OH >— ^ OH Oqb ^ dH Opb 

OQ~a^ ^dq~bWa + \dp~bWa 

Hamilton's equations therefore imply 

• OH 

= P a + 



0Q a 



fdPa 


dpb N 


)--y 


^(dP a 


dq b X 


\0H 


V dqb 


dQ a/ 


Idpb 4 


Adp b 


dQ a/ 


1 0q b 
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This equation will be satisfied if and only if 

d d d d 

W b Pa(qh ' Ph) = -Wa PbiQa,Pa) ' W b Pa[qb ' Pb) = Wa qbiQa ' Pa) - (3A15) 

This is the second set of conditions that must be met if the transformation is to be 
a canonical transformation. 

Equations (3.4.14) and (3.4.15) are called the direct conditions for a canonical 
transformation: all these conditions will be satisfied if the transformation Q a (q a ,Pa) 
and P a (q a ,Pa) is a canonical transformation. For a mechanical system with n de- 
grees of freedom we have a total of 4n 2 conditions. As we shall see, these are not all 
independent. In the next subsection we will identify a smaller, and more convenient, 
set of necessary and sufficient conditions. 

3.4-6 Canonical invariants 

As was stated in Sec. 3.4.1, a canonical transformation has the property of leaving 
the element of phase-space volume invariant: 

dV = d qi dq 2 ■ ■ ■ d Pl dp 2 ■■■ = dQ 1 dQ 2 ■ ■ ■ dP 1 dP 2 (3.4.16) 

A canonical transformation of the phase-space coordinates therefore has a unit 
Jacobian, J = 1. This statement can be shown to be a consequence of the direct 
conditions, Eqs. (3.4.14) and (3.4.15). 

Another consequence of the direct conditions is the fact that canonical transfor- 
mations leave all Poisson brackets invariant. Thus, if 



[f,9]q, P = Yl 



df_dg_ _ df dg 
dq a dp a dp a dq a 



is the Poisson bracket in the old phase-space coordinates, and if 

df dg df dg 



[/.5]q,p = 



dQ a dP a dP a dQ aj 

is the Poisson bracket in the new coordinates, then 

[f,9] g , P = [f,9}Q,P (3-4.17) 

if the transformation is canonical. 

It is this statement which provides us with an efficient method to test whether 
a transformation Q a = Q a (qb,pt,, t), P a = P a (qi,,pi ) , t) is canonical: By virtue of the 
automatic relations (refer back to Sec. 3.3.4) 

[Qa, Qb\Q,P — 0, [Qa, Pb]Q,P = $ab, [P a , A]q,P = 

and the invariance of the Poisson bracket, we must have that the relations 

[Qa,Qb]a,p = 0, [Qa,Pb] q ,p = 5 abl [Pa,Pb]a,p = (3.4.18) 

hold if the transformation is canonical. Similarly, a transformation q a = q a (Qb, P b ,t), 
Pa = Pa{Qb, P b , t) is canonical if the Poisson-bracket relations 

[q a ,q b }Q,P = 0, [q a ,P b ]Q,P = 5ab, [Pa,Pb]Q,P=0 (3.4.19) 



are satisfied. The conditions of Eqs. (3.4.18) or (3.4.19) can be shown to be sufficient 
and necessary. For a mechanical system with n degrees of freedom, we have a total 
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of (2n — l)n conditions to satisfy; when n = 1 there is only one relevant condition, 
[Q,P]g,P = 1 or [q,p] QtP = 1. 



Exercise 3.26. Verify that [Q, P],, p 
presented in Sec. 3.4.2. 



1 in the case of the canonical transformation 



We will not present a general proof of the statements that the phase-space 
volume element and the Poisson bracket are canonical invariants. We will, instead, 
present a proof that is restricted to a two-dimensional phase space. The restricted 
proof is easy to produce; the general proof would be much more difficult. 

We consider a canonical transformation of the form Q = Q(q,p), P = P(q,p). 
The direct conditions for this transformation are 



dQ 
dq 



dp 
dP' 



dQ 
dp 



dq 
dP 1 



dP dp dP dq 
dq dQ' dp dQ 



The volume elements are related by 



dqdp = \ J\ dQdP, dQdP = | J\- x dqdp, 

in which J is the Jacobian of the transformation, and J" 1 its inverse. The Jacobian 
is 



J = 



dq/dQ dq/dP 
dp/dQ dp/dP 



dq dp dq dp 
dQdP ~ dPdQ' 



Its inverse is 



dQ/dq dQ/dp 
dP/dq dP/dp 



dQdP _ dQdP 
dq dp dp dq 

By involving the direct conditions we may write this as 

dp dq dq dp 



J- 1 = 



dPdQ dPdQ' 



and this is equal to J. We therefore have J -1 = J, or J 2 = 1, and we conclude that 
| J | = 1. This proves that the volume element is indeed preserved under a canonical 
transformation. 

The Poisson bracket in the new phase-space coordinates is 



[/> 9\q, 



df dg 
dQ dP 



df dg 
dP dQ' 



If we consider / and g to be functions of q and p, we may use the chain rule and 
express this as 



[/. 9]q, 



df dq 
dq dQ 



df dp 
&pdQ 



dq dP dp dP J \dq dQ dp dQ , 
df dg ( dq dq dq dq \ df dg ( dq dp 



dg dq dg dp 
dq dP dp dP 
df dq ^ df dp \ f dg dq i dg dp 



dqdq\dQdP dPdQ 
df dg ( dp dq dp dq 

+ ~&p~dq'\dQd~P ~dPdQ 
dj^dg _ df_dg\ ( dq dp 
dqdp dpdq)\dQdP 



dq dp 



+ 



dqdp\dQdP dP dQ 
df dg ( dp dp dp dp 
^d^\dQd~P ~ dPdQ 
dq dp 
dPdQ 
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We have learned that \J\ — 1, and in practice we may always design the canonical 
transformation so that its Jacobian is in fact J = +1. This gives us, then, the 
statement that [/, <7]q,p = [f,g] q , P , and the Poisson bracket is indeed invariant 
under a canonical transformation. 



3.5 Hamilton-Jacobi equation 

3. 5. 1 Action as a function of the coordinates and time 
The action functional of a mechanical system is 



,S' = f Ldl. 

J t 

where L(q a , q a , t) is the system's Lagrangian. We first encountered the action in the 
context of Hamilton's principle of least action, in which one compares the value of S 
for different trial paths <7a" al (i) an d attempts to find the paths q a (t) that minimize 
this value. In the course of these investigations, back in Sees. 2.2 and 2.3, we derived 
the result 

for the variation of the action about reference paths q a {f). We obtained the Euler- 
Lagrange equations by demanding that SS = for variations Sq a (t) that respect 
the boundary conditions dq a (to) = 8q a {t\) = 0. 

We now intend to examine this result from a different perspective. Suppose that 
we compute S for actual paths q a (t) that satisfy the Euler-Lagrange equations; we 
assume that our actual paths leave the positions qa C&ln at t = to and arrive at the 
positions q^ nd at t = t\ . The result would be the number S, and this number would 
depend on the choices made for q„ eKin , q^, h, and t\. 

We now ask the question: Suppose that we next evaluate S on displaced paths 
q a (t) = q a (t) + 5q a (t) that all leave <7^ cgm at t = to but arrive at the different 
positions g° nd + 5q° nd at the time t = t\\ how will this value of S differ from SI The 
answer is this: Because the reference paths all satisfy the Euler-Lagrange equations, 
and because the variations Sq a all vanish at t = to, the change in the action has to 
be 

dL 



6S = T,»- 



t=ti 



end 



Writing dL/dq a — p a and 5q° nd — Sq a (t\), this is 

5S = J2Pa(h)Sq a (ti). (3.5.1) 

a 

This result indicates that the action S is a function of the variables q a (ti) = ql nd , 
S = S(q a (t!)), and that 

dS 

- 8qJh)' <^ 2 > 

It is understood that here, the partial derivative is evaluated while holding t\ fixed. 

Let us now consider a different variation of the action. This time we choose 
displaced paths q a (t) — q a (t) + Sq a (t) that still all leave q^ esin at t = to, but that 
now arrive at the same positions g® nd at a different time t = ti + 5ti; we wish to 
calculate by how much S differs from S under this change of paths. 
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To figure this out it is helpful to recall that the total derivative of the action 
with respect to t\ is given by 

dS . , 

drr L{h) > 

in which the Lagrangian function is evaluated at t = t\. We already know that 
the action depends on ti through its dependence on q° nd = q a (ti). We should 
also expect that the action contains an explicit dependence on t\. Its total time 
derivative must therefore be expressed as 

dS__3S_ x ^ dS . 
aW.-dT^^dq-Jh)^- 

In view of Eq. (3.5.2), this is 

dS OS ^ — ^ . \ . / \ 
+ 2^Pa{ti)q a {ti). 



dti dt\ 

u> 

From all this we obtain 

|| = L(tl) - $>«(*l)?a(*l)- 

a 

The right-hand side is (minus) the Hamiltonian function evaluated at t = t\, and 
our final result is 

Wi = -H(h). (3.5.3) 

It is understood that here, the partial derivative is evaluated while holding the 
final positions q a (ti) fixed. This gives us the answer to our question: The variation 
considered here has fixed final positions and a varying time; the change in the action 
S(q a (h),ti) is 5S = (dS/dt^Sh, or 

6S = -H(t 1 )6t 1 . (3.5.4) 

The complete variation of the action, if we allow all of q° nd and t\ to be varied, is 
given by the sum of the partial deviations computed above. The general statement 
is 

dS = J2p a (tt) dq a (h) - H(h) dh. 
a 

Because this statement is true at any time t\, we may express it as 

dS = ^p a dq a - Hdt, (3.5.5) 

a 

where the momenta and the Hamiltonian are now evaluated at the arbitrary time 
t. This relation informs us that when the action is evaluated on the actual paths 
q a (t), it can be viewed as a function of the coordinates q a {t) and of time t: 

S = S(q a ,t). (3.5.6) 

Its partial derivatives are then given by 

dq- a =Pa > -dJ = - H - (3 ' 5 ' 7) 

As a concrete illustration of these notions, let us evaluate S(q, t) in the case of 
the linear pendulum of Sec. 3.4.2. The pendulum's Lagrangian is 

L = \q 2 - (3.5.8) 
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and the Euler-Lagrange equation for q(t) is q+u) 2 q = 0. The actual path is therefore 
given by 

q(t) = qo cos ivt + — sin Lot, (3.5.9) 

where go = l(t = 0) and q n = q(t = 0) are the initial conditions. We substitute this 
inside Eq. (3.5.8) and obtain 

L = ^ (<7q — uj 2 ql ) cos 2uot — Ljq q sin 2tot 

after some simplification. Setting t a — and t\ = t, the action is S — J * L dt, and 
this evaluates to 

S = ^— (<?q — to 2 q 2 ) sinujtcosujt — q a q a sin 2 cut. 

This does not yet have the expected form S(q 7 t) with q = q{t). To put the action 
in this form we solve Eq. (3.5.9) for qo and substitute this into our expression for 
S. After some simple algebra, we obtain our final answer, 

S(q, t) = \{q 2 + q 2 ) COS ut - 2q q] , (3.5.10) 

in which q stands for q(t), the changing position of the pendulum. It is easy to 
check that dS/dq = q cos Lot — uiq sinujt = q = p and —dS/dt = |(j 2 + \u 2 q 2 = H, 
in agreement with Eqs. (3.5.7). 



Exercise 3.27. Go through all the algebra that leads to Eq. (3.5.10), starting from 
Eqs. (3.5.8) and (3.5.9). Then check that Eqs. (3.5.7) do indeed follow for this action. 



3.5.2 Hamilton- J acobi equation 

We have seen that the partial derivative with respect to time of the action is related 
to the Hamiltonian by 

TT 9S n 

The Hamiltonian is a function of the coordinates q a and the momenta p a , so that 
H = H(qi, q2, ■ ■ ■ ;pi,P2, ■ ■ ■ t). But we have also seen that the momenta are related 
to the action by p a = dS/dq a . Putting this all together, we arrive at the equation 

fff?i,52,---;| 5 ,| 5 ,---;^+l?=o. (3.5.11) 



dqi ' dq 2 ' / dt 

This is a partial differential equation for the function S(qi, qi, ■ ■ ■ ; t), and this equa- 
tion is known as the Hamilton- J acobi equation. As we shall now explain, solving the 
Hamilton-Jacobi equation for S provides a round-about way of obtaining a com- 
plete solution to the original mechanical problem, which is to calculate how the 
coordinates q a behave as a function of time. This technique is intricate, but it can 
be very powerful. 

Suppose that we can find a solution to the Hamilton-Jacobi equation, and sup- 
pose that it has the general form 

S = S(q 1 ,q 2 ,-- ■ ,q n ;ai,a 2 , ■ ■ • ,£*„;*), (3.5.12) 



where n is the number of degrees of freedom, and where the a a 's are n indepen- 
dent constants of integration. Such a solution is called a complete solution to the 
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Hamilton-Jacobi equation, because it possesses a number of integration constants 
that corresponds to the number of independent variables q a . We assume that a 
complete solution exists and can be obtained; we do not assume that this solution 
is unique — indeed it is not — nor that it is the most general solution to the 
Hamilton-Jacobi equation — which it is not. 

To establish a connection between S(q a , a a , t) and the original mechanical prob- 
lem we identify it with F2(q a , Pa,t), the generating function of a canonical transfor- 
mation. Here the new momenta P a arc identified with the constants a a , and we will 
see in a moment that the dynamics generated by the new Hamiltonian H' is indeed 
such that P a = 0. The general theory of canonical transformations developed in 
Sees. 3.4.3 and 3.4.4 implies that the old momenta p a are given by 

_ dF 2 _ 8S 

P a — ~a n ' 

dq a dq a 

and this statement is certainly compatible with our derivation of the Hamilton- 
Jacobi equation. The general theory also implies that the new coordinates Q a are 
given by 

dP a da a ' 

The evolution of the new phase-space variables is governed by the new Hamiltonian 
H', which is 

The new Hamiltonian vanishes by virtue of the Hamilton-Jacobi equation! There 
is no dynamics in the new variables, because Q a — dH'/dP a = and P a = 
—dH'/dQ a = 0. We have already anticipated the fact that the new momenta 
P a = oi a are constants of the motion; we now have learned that the new coordinates 
Qa = Pa are constants also. 

The entire content of the Hamilton-Jacobi framework boils down to this: Once a 
complete solution S(q a ,a a ,t) to the Hamilton-Jacobi equation has been identified, 
the coordinates q a (t) of the mechanical system are obtained by unwrapping the 
equations 

/3 a = ^S(q b ,a b ,t), (3.5.13) 

where the n quantities f3 a , like the n quantities a a , are constants. Solving these 
equations will return equations of the from q a = q a (a 01 /3b, t), and the coordinates 
will be seen to depend on time as well as a number 2n of constants; this is as it 
should be, because we have n variables q a and they each satisfy a second-order 
differential equation. The momenta can then be computed as 

d 

p a = —S(q b ,a b ,t), (3.5.14) 
dq a 

and these will also be of the form p a = p a (ctb, Pb, t). The motion in phase space 
is thus completely determined, and the constants a a and f3 a can be related to the 
initial conditions q a (t = 0) and p a (t = 0). 

3.5.3 Case study: Linear pendulum 

To see how this all works, let us return once more to the linear pendulum and its 
Hamiltonian 

H= l -p 2 + l -uj\ 2 . (3.5.15) 
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The Hamilton- Jacobi equation for this Hamiltonian is 

1 fdS\ 2 1 2 2 dS n ,„ „ 

Because the mechanical system has a single degree of freedom, we wish to find 
a complete solution of the form S(q,a), with a playing the role of a constant of 
integration. 

We can separate the variables by adopting 

S = W(q) -at (3.5.17) 

as a form of solution; this already involves the constant a. Substituting Eq. (3.5.17) 
into Eq. (3.5.16) produces an ordinary differential equation for W: 

I(W/)» + I w y-a = 0. 
This can easily be solved for W , 



UJ 2 



W = V2a\ll-— q\ 



and integration yields 



W = V2a I \jl - 7^q 2 dq. 



To evaluate the integral we introduce a new variable of integration $, which is 
defined by 

sin$=-^=q. (3.5.18) 
V2a 



Substituting this, along with dq = (v2a/u) cos $ d$ into the integral for W pro- 
duces 

W= — [ cos 2 $<i$. 
w J 

The integral works out to be |(<& + sin$cos<I>), and we arrive at 

a 

W = -($ + sin$cos$). (3.5.19) 

This could be expressed directly in terms of q by involving Eq. (3.5.18), but it is 
more convenient in practice to leave W(q) in this implicit form. 
Our final result for the action is 



a \/2a 
S(q,a,t) = -($ + sin$cos$) - at, q= sin$. (3.5.20) 

UJ UJ 

Let us now use this information to determine the motion of the pendulum. Accord- 
ing to Eq. (3.5.13) we must first calculate dS/da and set the result equal to a new 
constant (3. The dependence of S on a is both explicit — S is proportional to a — 
and implicit, because S depends also on 3> which itself depends on a. We therefore 
write 

dS 1 _ . ^ . a* 2 ^ . 2 ^<9$ 

— = — ($ + sin $ cos $) — i H (1 + cos $ - sm $) — 

oa uj uj Oa 

1 ■ ^^ 2" 2^5$ 

= — ($ + sm$cos$) H cos $- t, 

uj uj oa 
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and we evaluate the remaining partial derivative using Eq. (3.5.18) as a starting 
point. Here we treat q as a constant and differentiate the two sides of the equation 
with respect to a; this gives 

*<9$ ujq 1 . , 

cos<P— - = = — — — — — sin$, 

da 2V2a 3 / 2 2a 

after reinvolving Eq. (3.5.18) in the last step. We have obtained 

— = — ($ + sin $ cos $) cos $ 

oa uj uj cos <t> 

= U-t 

CO 

= P, 

or 

$ = w(i + /3). (3.5.21) 

The motion of the pendulum is finally determined by substituting Eq. (3.5.21) 
into Eq. (3.5.20); our final result is 



\/2a 

q(t,a,/3) = - — sinw(t + /3). (3.5.22) 

UJ 

This evidently describes simple harmonic motion of amplitude v / 2a/u' at a frequency 
u)\ this well-known result has been obtained in a very novel way, by solving the 
Hamilton-Jacobi equation. While the use of this fancy technique hardly seems 
justified for such a simple problem (the phrase cracking a nut with a sledgehammer 
comes to mind) , the Hamilton-Jacobi framework has been shown to be very powerful 
in other, more complicated, situations. 

We can easily relate the constants a and P to the initial conditions of the motion. 
Evaluating Eq. (3.5.22) at t = gives 

q = q(t = 0) = sm ujp, 

UJ 

while differentiating Eq. (3.5.22) with respect to time and then evaluating at t = 
gives 

qo = q(t = 0) = V2a cos ui/3. 

These relations can easily be solved for a and (J. The constant (3 has a direct 
physical meaning: it determines the initial phase of the pendulum. The constant a 
also has a clear physical meaning: Solving for a yields 

a = \ol + l^ 2 ql (3.5.23) 
and this is the pendulum's total mechanical energy. 



Exercise 3.28. Calculate the momentum p of the pendulum starting from Eq. (3.5.14); 
show that p(t, a, 0) = q(t, a, (3). 



Exercise 3.29. You may have noticed that the action of Eq. (3.5.20) is very different 
from the action of Eq. (3.5. fO), which we rewrite as 



S(q,a',t) 



2 sinuit 



(q 2 + a' 2 ) coscut — 2a q 
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with a' = qo- Despite the functional differences, these are two different representations of 
the same physical quantity, expressed in terms of two different constants, a and a' . Show 
that the action given here is also a solution to the Hamilton-Jacobi equation. Determine 
the motion of the pendulum by setting dS/da' equal to a new constant /?'; what is the 
physical meaning of /?'? 



3.6 Problems 

1. A bead of mass m slides on a frictionless wire that is shaped in the form of a 
cycloid. This is described by the parametric equations 

x = a(9 — sin 6*), y = a(l + cos 9), 

where a is a constant and the parameter 9 ranges through the interval < 
9 < 2tt. The bead is subjected to gravity, and it oscillates back and forth on 
the wire. (See problem #3 from Chapter 2.) 

(a) Using 9 as a generalized coordinate, calculate the bead's Hamiltonian. 

(b) Obtain Hamilton's canonical equations of motion for the bead. 

2. A particle of mass m moves on a paraboloid of revolution described by the 
equation 

z=-(x 2 + y 2 ), 
a 

where a is a constant. (See the figure for problem #4 from Chapter 2.) The 
particle is subjected to gravity, so that its potential energy is V = mgz. 
Using the cylindrical coordinates p and (f> as generalized coordinates, find 
the Hamiltonian of the particle. [The cylindrical coordinates are defined by 
x = pcoscj), y — psin^.J 

3. A straight frictionless wire is attached at a height h to the z axis, and it makes 
an angle a relative to the z axis. The wire rotates around the z axis with 
a constant angular velocity f2. A bead of mass m slides on the wire and is 
subjected to gravity; it is at a distance r from the point at which the wire is 
attached to the z axis. (See the figure for problem #5 from Chapter 2.) 

(a) Using r as a generalized coordinate, calculate the bead's Hamiltonian. 

(b) Obtain Hamilton's canonical equations of motion for the bead. 

4. A particle of mass m is constrained to move on the surface of a cylinder. The 
cylinder is described in cylindrical coordinates by the equation p = R, where 
p is the distance from the z axis and R is the cylinder's radius. The particle is 
subjected to a force directed toward the origin of the coordinate system and 
proportional to the distance between the particle and the origin; this force is 
described by F = —kr, where k is a constant and r is the particle's position 
vector. (See problem #8 from Chapter 2.) 

(a) Using the cylindrical coordinates z and <j) as generalized coordinates, find 
the particle's Hamiltonian. 

(b) Obtain Hamilton's canonical equations of motion for the particle. Show 
in particular that p,p is a constant of the motion. 

(c) Draw the particle's motion in the reduced phase space spanned by z and 

Pz- 
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5. A pendulum of mass m 2 and length £% is attached to another pendulum of 
mass mi and length l\ (see diagram). The first pendulum is at an angle 0i(t) 
relative to the vertical, while the second pendulum is at an angle ^2 (i) ■ We 
wish to determine the motion of this double pendulum. 




(a) Show that the Lagrangian of the double pendulum is given by 

L = ]^{mi+m 2 )£\9\ + ]^m 2 £\9l+m 2 £i£ 2 9i9 2 cos(8 1 -d 2 ) + (m 1 +m 2 )gii cos 6 1 +m 2 gi2 cos9 2 . 

(b) Calculate the generalized momenta pi and p 2 and express 9\ and 9 2 in 
terms of them. 

(c) Find the Hamiltonian of the double pendulum. 

(d) Show that Hamilton's equations are 

• £ 2 Pi -£ip 2 cos{9 1 - 6 2 ) 



e i £ 2 [m 1 +m 2 sm 2 {6 1 -e 2 )Y 
^ _ {mi + m 2 )£ip 2 - m 2 £ 2 pi cos(gi - 
1 ~ Ji4 [mi +m 2 sin 2 (0i - 6 2 )} 

pi = — A + B — (mi + m^gix sin 6\, 
p 2 = A - B - m 2 g£ 2 sin 9 2l 



where 



and 



A = 



pip 2 sin(#i - 62) 
[mi + m 2 sin 2 (6»i - 9 2 )] 



B 



m 2 t\p\ + (mi + m 2 )£\p\ - 2m 2 £i£ 2 pip 2 cos(#i - 9 2 ) 
^[mi+mj sm 2 (9i-8 2 )\ 2 



sin{9i-9 2 )cos(9i-9 2 ). 
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Appendix A 
Term project: Motion 
around a black hole 



This term project is to be handed in on the last day of classes. While working on this 
project you are permitted to have discussions with your colleagues on any aspect 
of the project. However, and this is important, you are not allowed to directly 
collaborate when carrying out the tasks listed below. You must work through these 
by yourself, independently of anyone else. Cheating will not be tolerated. 

In this term project you will examine the motion of a particle in the strong 
gravitational field of a (nonrotating) black hole. The equations of motion for the 
particle are derived from Einstein's general relativity according to which the particle 
must follow a geodesic — the straightest possible path — in the curved spacetime 
of the black hole. The motion of the particle is represented in polar coordinates by 
its radial position r(t) and its angular position <p(t). Because the gravitational field 
of the black hole is spherically symmetric, the motion takes place within a plane 
(just as in Newtonian theory). 



A.l Equations of motion 

The general relativistic equations of motion are 

; h 



(A.l.l) 



for the angular position, where h is a constant (related to the particle's angular 
momentum), and 

.. GM h 2 ( 3R\ n , A , „x 

r + ^r-z* U-^T =0 (A.1.2) 



2r J 

for the radial position. Here an overdot indicates differentiation with respect to t, 
and 

«=H£M (A ,. 3) 

is the Schwarzschild radius of the black hole. Notice that Eq. (A.l.l) is identical 
to its Newtonian analogue, and that Eq. (A.1.2) is very similar to it. In fact, the 
equations of motion reduce to their Newtonian expressions when R/r <C 1, that is, 
when the particle is very far from the black hole so that the hole's gravitational 
field is very weak. 



Task 1. Calculate R for a black hole whose mass is equal to the Sun's. Then 
calculate R/R@, the ratio of the Sun's Schwarzschild radius R to its actual radius 
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Rq. This number characterizes the size of general relativistic effects in the solar 
system. These effects are small, but they have been measured. 



Task 2. Integrate Eq. (A. 1.2) and express your result in the usual form 

if 2 + v(r) = e, (A. 1.4) 

in which v(r) is the reduced effective potential and e the reduced total mechanical 
energy (which is constant). 



Task 3. Sketch the form of the effective potential v(r) and give a complete 
qualitative description of the possible motions. Be careful: there is a wider range 
of possibilities than in Newtonian theory. When you plot the potential be sure to 
choose values of h that are both above and below the critical value 

h c = V QGMR. (A.1.5) 

Explain what happens to v(r) when h decreases below h c . 



A. 2 Circular orbits 

The equations of motion admit solutions in which r(t) stays constant, r(t) = r = 
constant; these solutions represent circular orbits around the black hole. 



Task 4. For a circular orbit of radius r , calculate its angular momentum h, its 
energy e, and its angular velocity <p. Show that your relativistic results reduce to 
the Newtonian expressions when R/r is very small. 



A. 3 Eccentric orbits 

Eccentric motion is possible when e < 0; there are solutions to the equations of 
motion which describe a particle that moves between two turning points at r = r_ 
and r = r + , where r_ < r + . It is convenient to introduce the same parameterization 
as in Newtonian theory, and to set 

r_ = ^ — = pericentre, r + = ^ - = apoccntre, (A. 3.1) 

where p is an average radius and e an eccentricity. When e = we have that 
r_ = r + = p = ro, and the orbit is circular. 

The results of Task 3 above will have revealed that unlike in Newtonian theory, 
where there are only two turning points, there exists in the relativistic situation 
a third turning point at r = r<. (For this to be true we need e < 0, which was 
already assumed, and h > h c , which is understood.) We have r< < r_, and for our 
purposes this third turning point plays a mathematical role, but has no physical 
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significance. The three turning points are located by setting r = in Eq. (A. 1.4), 
which implies v(r) — e = 0. 

Task 5. Show that v(r) — e has the structure of a cubic polynomial in 1/r. This 
can therefore be presented in the factorized form 

in which k is a constant of proportionality. As required, v(r) — e vanishes whenever 
r becomes equal to either one of r< , r_ , or r + . Use this observation to: 

1. Calculate k. 

2. Express r < in terms of r_, r+, and i?. 

3. Express /i 2 in terms of r_, r + , i?, and GM. 

4. Express e in terms of r_, r+, -R, and GM. 

Finally, clean up these results by substituting Eq. (A.3.1). Show that 

h 2 - u MP 2 , , (A.3.3) 
1- i(3 + e 2 )i?/p 1 ; 

e = - GM (l-e>) \- 2R {v / . (A.3.4) 

Notice that Eqs. (A.3.3) and (A.3.4) reduce to the Newtonian expressions when 
R/p <C 1, as should be expected. 



and 



Task 6. Prove that the condition r< < r_ implies 

p>(3 + e)R. (A.3.5) 

This means that the particle must be at a safe distance away from the black hole 
to be able to keep an eccentric orbit. If this condition is not met the particle will 
be forced to plunge into the hole. 



A. 4 Numerical integration of the equations of 

motion 

A viable strategy to integrate the equations of motion would be to recast Eqs. (A. 1.1) 
and (A. 1.2) as a system of first-order equations, such as r = v, v = —GM/r 2 + 
h 2 (l — 3R/2r)/r 3 , and cb = h/r 2 . One would then select values for e and p, evalu- 
ate h from Eq. (A.3.3), and integrate the equations starting from the initial values 
r(0) = r_ = p/(l + e), v(0) = 0, and 0(0) = 0. From the numerical information 
thus obtained one could then reconstruct the shape of each selected orbit. 

We shall adopt instead an alternative, simpler strategy that will allow us to 
obtain the orbit more directly, at the price of eliminating t from the system of 
equations. We will thus obtain complete shape information, but give up on any 
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temporal information. To formulate this strategy we need to pursue the analytical 
work a bit more. 

It is convenient to introduce an angular parameter \ and to mathematically 
represent the radial part of the motion as 

r( X ) = — ± . (A.4.1) 

1 + e cos x 

We see that as \ proceeds from to tt and then to 2ir, the orbital radius proceeds 
from r_ = p/(l + e) to r + = p/(l — e) and then back to r_; as x ranges through 
the interval < \ < 2tt the particle undergoes what we shall call a complete radial 
orbit. 



Task 7. Combine Eqs. (A. 1.4), (A. 3. 2), and (A.4.1) and derive an expression for 
X- The right-hand side should involve GM, R, p, e, and cos x only. Try to simplify 
this expression as much as possible. 



Task 8. Combine the result of Task 7 with Eqs. (A. 1.1) and (A. 3. 3) and derive 
the equation 

^ = 1 (A.4.2) 

dx yjl- (3 + e cos X )R/p 

This equation shows that x becomes equal to <p in the nonrelativistic limit R/p -C 1. 



Equation (A.4.2) can be numerically integrated for 4>(x)- This, together with 
Eq. (A.4.1), give the exact shape of the relativistic orbit around the black hole. 



Task 9. Using whatever method at your disposal, integrate Eq. (A.4.2) numerically 
and obtain the orbit of the particle. Do this for the following values of p and e: 



orbit 


1: 


p/R = 


5.5, 


e 


= 0.60377; 


orbit 


2: 


p/R = 


4.2, 


e 


= 0.66155; 


orbit 


3: 


p/R = 


3.7, 


e 


= 0.42387; 


orbit 


4: 


p/R = 


3.7, 


e 


= 0.61976; 



orbit 5: your own selected values; 

orbit 6: your own selected values (different from above). 

In the numerical work it is a good idea to measure p (and r) in units of R; this 
eliminates R from all equations. For each case listed above, plot the shape of the 
orbit in the x-y plane. For each case let the parameter \ range through the interval 
< x < 47r, to make sure that the particle undergoes two complete radial orbits. 



Task 10. Provide a summary of your numerical results by listing the following 
quantities in a table: p/R, e, r_/_R, r + /R, and Acj)/(2n). The last quantity, 

A0 _ <Mx = 2tt) - <Hx = 0) ( , A ^ 

27T ~ 27T ' [AA - 6) 
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is the total change in <fr during one complete radial orbit, divided by 2tt; this is 
the number of revolutions that the particle completes during one radial orbit. In 
Newtonian theory this number would always be equal to unity, and the orbit would 
close on itself. In general relativity this number is generally larger than unity, and 
the orbit typically does no close. (For carefully selected sets of orbital parameters, 
the orbit may close after the particle completes a certain number of radial orbits.) 



